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THE PERFORMANCE OF PSYCHIATRISTS AND PSYCHOLOGISTS 
IN A THERAPEUTIC INTERVIEW! 


HANS H. STRUPP 
University of North Carolina* 


INTRODUCTION 


In the course of psychotherapy it is ordinarily not possible to make systematic 
observations about the effects of therapist variables upon the treatment process, yet 
there is little doubt that the totality of the therapist’s personality—his age, sex, 
experience, maturity, attitudes, and a host of other factors—is subtly intertwined 
with his therapeutic techniques and the theoretical framework he brings to bear 
upon his therapeutic operations. It is also apparent that the scientific study of the 
therapeutic process requires precise information concerning the relative contribu- 
tions of these variables to answer the central question of therapeutic change. We 
cannot make any definitive statements about the supposed effects of technique A or 
technique B as long as the person of the therapist is omitted from consideration. 
Wolberg’s statement, that therapists ‘seem to do well or poorly because of subtle 
personality factors rather than because of their particular orientation” “: »- "° may 
be correct or an overstatement, but the answer will not be forthcoming until we can 
assess the relative influences of the relevant variables. This goal, unfortunately, lies 
far in the future. 

On a less ambitious level, it appears reasonable to inquire how different thera- 
pists approach a patient during an initial interview, what impressions they form, how 
they conceptualize their observations, and how they respond to the patient facing 
them. By taking such data as a starting point, one might be able to investigate the 
extent to which variations in therapist performance are determined by variables in 
his background, experience, and personality. This focus on the therapist, rather than 
the more usual one on the patient, characterized the investigation of which this 
paper forms a part. 

A series of results based upon the performance of 134 psychiatrists has been re- 
ported in an earlier paper) which gives a more detailed description of the rationale, 
procedure, and results, and discusses their implications at greater length. The present 
article presents a number of parallel analyses based on the performance of a group of 
therapists whose background training was in psychology rather than in medicine. 


METHOD AND PROCEDURE 


Briefly, the experimental procedure consisted of a sound film of an initial inter- 
view in which a neurotic patient was being interviewed by an inexperienced resident 
in psychiatry. The motion picture was adapted for experimental purposes by intro- 
ducing 28 interruptions (each 30 seconds in length) at preselected points. It was 
shown to groups of psychotherapists who were asked to assume the role of vicarious 
interviewers and to record in writing what they would have said to the patient. 
Following the film showing, therapists completed a comprehensive questionnaire on 
their diagnostic impressions, treatment plans, goals, etc. 

Complete data were collected from 235 therapists in several cities. A previous 
paper®? dealt with the responses of 134 psychiatrists (analysts, psychiatrists, and 
residents in psychiatry). In another article“, the performance of 64 psychoanalytic- 
ally oriented psychologists was compared with that of 14 psychologists following the 
client-centered theory of Rogers. This paper presents comparisons between 55 psy- 
chiatrists and 55 psychologists, matched on length of experience in psychotherapy 


1Presented at the 1957 meeting of the American Psychological Association. 

*This research was supported by a research grant (M-965) from the National Institute of Mental 
Health, Public Health Service, and carried out at George Washington University, School of Medicine. 
I wish to acknowledge my appreciation to Winfred Overholser, M. D., and Leon Yochelson, M. D., for 
encouragement and advice. 
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and personal analysis (Table 1). In addition, reference will be made to certain paral- 
lel analyses performed separately for psychiatrists and psychologists. 


TasBLe 1. SumMMARY OF BIOGRAPHICAL VARIABLES FOR MATCHED SAMPLES OF 
PSYCHIATRISTS AND PsYCHOLOGISTS 








Length of Personal 
Experience Analysis Age 
y nee (hours) (years) 
8.D. M 8.D. 8.D. 





Psychiatrists - Analyzed 
Psychologists - Analyzed 
Psychiatrists - Nonanalyzed 
Psychologists - Nonanalyzed 
All Psychiatrists 

All Psychologists 


Lv 


322 228 
319 210 


[Nw 
ROW me Oro 


(322) (228) 
(319) (210) 


NOnW Ran 


WWD Pp 
NIT WOO < 
00 D> O11 00 


ww 





RESULTS 
To facilitate comparisons between the results obtained for psychiatrists and 


psychologists, the order of presentation follows that of the first article @. 


Diagnosis. Like psychiatrists, analytically oriented psychologists chose a variety of diag- 
nostic labels, ‘‘anxiety’’ being the preferred diagnosis. The response distributions of the matched 
samples did not differ significantly. 

Defense Mechanisms. Psychologists mentioned almost the same number of defense mechan- 
isms as psychiatrists (an average of about 3). Chi square did not indicate statistically significant 
differences between the two samples. 

Formulation of Dynamics. The kinds and quality of dynamic formulations, as defined pre- 
viously, did not differentiate the matched samples. In fact, the distributions disclosed a marked 
degree of similarity. 

Other Diagnostic Evaluations. A number of diagnostic assessments were obtained through 
precoded items. These included: Ego Strength, Anxiety, Insight, Capability of Self-Observation 
and Self-Appraisal as opposed to the Tendency for Rationalization, Emotional Maturity, Social 
Adjustment, Disturbance. The evaluations of psychiatrists and psychologists appeared to be 
very similar. No chi square value was significant at an acceptable level. 

Prognosis. While respondents differed in their prognostic estimates, differences between 
psychiatrists and psychologists were statistically nonsignificant. 

Attitude toward the Patient. The matched samples were undifferentiated in terms of their atti- 
tudes toward the patient (self-ratings on a five-point item); nor did psychiatrists differ from psy- 
chologists regarding the reasons given for their professed conscious attitude. 


Analysis of Systematic Differences. In order to investigate further the possible 


reasons for differences in therapists’ evaluations the full psychiatrist sample (N = 
134) was successively broken down according to a number of therapist variables. 
The results of these analyses may be summarized as follows: 


1. Length of experience appeared to have a negligible bearing on therapists’ evaluations. 
Significant but moderate correlations were observed between experience and estimates of Ego 
Strength and quality of the dynamic formulation, that is, the more experienced therapists estim- 
ated the patient’s ego strength to be less, and they contributed a larger proportion of “superior” 
formulations. 

2. Similarly, older therapists judged the prognosis without therapy to be more unfavorable. 

3. Breakdowns in terms of theoretical orientation and personal analysis were inconclusive, 
chiefly because of difficulties in isolating ‘‘pure’’ groups. 

4. A number of statistically significant correlations obtained between therapists’ attitude 
toward the patient and clinical evaluations: therapists indicating a negative attitude were more 
likely to choose such diagnostic labels as psychopath, character disorder, paranoid, and phobic. 
They also tended to see the patient as less anxious, having less insight, being less capable of self- 
observation, more immature emotionally, and more poorly adjusted socially. 

5. A statistically significant correlation was observed between prognosis and therapists’ 
attitude (r = .39) which signified that therapists having a positive attitude toward the patient 
tended to give a more favorable prognosis. Prognostic estimates were also shown to be correlated 
with other clinical evaluations, and followed a pattern similar to that reported for therapists’ 
attitude. 
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In order to determine the extent to which these relationships were paralleled in 
the sample of psychologists it would have been desirable to conduct the analyses on 
the matched samples. Unfortunately, the relatively small size of these samples 
rendered this unfeasible. As a compromise, the full sample of psychologists was used, 
In evaluating the results it must be kept in mind that the psychiatrist sample was 
notably larger (134 vs. 64), more experienced (mean of 6.8 years vs. 3.1 years), and 
somewhat older (38.2 years vs. 35.6 years). In addition, the range of experience, as 
indicated by the standard deviation, was twice as great for psychiatrists as for psy- 
chologists. Despite these shortcomings it seemed appropriate to report the results 


TaBLeE 2. Propuct-MoMENT COEFFICIENTS OF CORRELATION BETWEEN ATTITUDINAL AND 
BIoGRAPHICAL VARIABLES—AVERAGE N (PsyYcHoLoaists) = 








Prognosis Attitude Experience Experience Competence Age 
Variable with toward (in years) (Self- as 
Therapy Patient estimate) Therapist 


Ego Strength .26* .08 ol" 23 — .09 .19 
Anxiety 24 .23 .23 16 — .06 10 
Insight ia 18 24 .29* — .22 12 
Self-Observation 

vs. Rationalization .29* .28* 15 13 .07 .02 
Emotional Maturity — .13 an mi 05 .00 19 
Social Adjustment 17 .06 .30* 26° .O1 2a° 
Disturbance — .31* 10 21 .3o"* 18 .27* 
Prognosis without 

Therapy —.18 .16 .23 .35** .20 .23 
Prognosis with 

Therapy A! By .19 .08 .03 
Attitude toward 

Patient 25° 17 ll .08 
Experience (in 

years) 
Experience (self- 

estimate) 
Competence as Thera- 

pist (self-estimate) 


*Significant at the .05 level 
**Significant at the .01 level 


for two reasonably large groups of therapists participating in this investigation. The 
layout of table 2 is directly comparable to the one presented earlier for the tull 
psychiatrist sample. The results for the full psychologist sample may be summarized 
as follows: 

1. As in the psychiatrist sample, length of experience was significantly correlated with 
Ego Strength, and quality of dynamic formulation (chi square significant at the .01 level). In 
addition, however, experience correlated significantly with Social Adjustment and Attitude to- 

ward the Patient. If self-estimates of experience rather than number of years were taken as the 
reference variable, Insight, Disturbance, and Prognosis without Therapy also disclosed significant 
correlations. Increasing experience tended to cov ary with the more unfavorable estimates. 

2. Therapists’ age correlated significantly with estimates of the patient’s Social Adjustment 
and Disturbance, the direction of the correlation being similar to that reported in the preceding 
paragraph. 

3. Breakdowns in terms of personal analysis and theoretical orientation were unfeasible 
for the psychologist sample as well. 

4. Correlations between therapists’ attitude and clinical evaluations appeared to be of a 
somewhat lower order than that re — for psychiatrists. Statistical significance was obtained 
in the case of Self-Observation vs. Rationalization and quality of the dynamic formulation. 

5. Asin the case of psychiatrists, prognosis correlated significantly with therapists’ attitude 
(r = .42). Prognostic estimates were also found to be significantly correlated with such clinical 
evaluations as Insight, Self-Observation vs. Rationalization, and Disturbance. 








While the correlations for psychologists did not disclose a one-to-one relation- 
ship to those reported for psychiatrists, the pattern must be judged rather similar if 
allowance is made for inevitable sampling fluctuations and the sample differences 
already mentioned. 
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Therapeutic Goals and Approaches. The distributions of responses given by the 
matched samples of psychiatrists and psychologists differed significantly in terms of 
therapeutic goals (p < .01). Psychiatrists tended to specify more modest goals, such 
as symptom relief, whereas a larger proportion of psychologists stressed greater in- 
sight, greater self-acceptance, etc. Virtually no member in either group envisaged an 
extensive reorganization of the patient’s personality. 

Responses did not differ significantly on the following questionnaire items: 
Areas in which therapists expected the patient to “move” and areas in which they 
expected him to remain relatively unchanged; areas or problems which might prove 
most amenable to therapy; areas in the patient’s living which might receive primary 
focus in treatment; attitudes or kinds of behavior which the therapist would en- 
courage (or discourage) in therapy with this patient; areas which should be avoided 
(early) in therapy. 


Problems of Therapeutic Technique. Psychiatrists appeared to differ very little from 
psychologists on a number of questions including: clues derived from the patient’s 
behavior during the interview; descriptions of the course of the transference and its 
technical handling; anticipation of countertransference reactions and methods of 
dealing with them; difficulties and/or dangers which might arise in the course of 
therapy; approach to the patient’s phobic symptoms; the desirability of making 
recommendations to the patient to change his present mode of living or his environ- 
ment; therapist’s activity or passivity. 

The matched samples diverged significantly with regard to their expectations of 
acting out behavior and its therapeutic handling. A larger proportion of psychia- 
trists expected the patient to engage in this form of transference reaction (p < .02); 
they were also more likely to state that they would actively discourage its develop- 
ment by firmness, strictness, setting limits, etc. (p < .05). Responses to a question 
dealing with the desirability of strictness or permissiveness very probably reflected 
the same line of thinking in that psychiatrists advocated strictness more than their 
psychologist colleagues (p < .01). Similarly, psychiatrists tended to be more out- 
spoken about discouraging free association with this patient, in contrast to a larger 
proportion of psychologists who explicitly encouraged it (p < .01). 


Form of Therapy, Frequency of Sessions, and Length. Comparisons of the response 
distributions for the matched samples indicated that psychologists tended to ad- 
vocate the more intensive forms of psychotherapy with this patient than psychia- 
trists (p < .05). There was no statistically significant difference regarding the fre- 
quency of weekly sessions, but the trend was for psychiatrists to prefer one weekly 
interview. There was a slight tendency for psychiatrists to estimate the duration of 
psychotherapy to be shorter than psychologists (p < .10): more psychiatrists than 
psychologists estimated the length of treatment to be less than one year. 

The distributions did not differ significantly regarding the time required to 
achieve symptom relief. A question relating to possible alternatives to the re- 
spondent’s first choice of treatment elicited a somewhat larger proportion of recom- 
mendations for pharmacological therapy (presumably tranquilizing drugs) from 
psychiatrists, but differences in the over-all distributions were nonsignificant. 


Analysis of Systematic Differences. The results for breakdowns of the full 
psychiatrist sample in terms of selected therapist variables, reported previously, 
may be summarized as follows: 


1. The responses of psychiatrists, broken down by experience level, revealed almost no 
statistically significant differences, with these exceptions: Analysts (the most highly experienced 
group) judged the length of therapy to be longer than either psychiatrists or residents. Analysts 
preferred 3 or more weekly sessions, in contrast to psychiatrists who showed a predilection for 1 
or 2 weekly interviews. The more experienced therapists tended to characterize the course of the 
transference in terms of interpersonal dynamics rather than in descriptive terms, which were pre- 
ferred by the less experienced psychiatrists. 

2. The therapist’s conscious attitude toward the patient was found to be significantly asso- 
ciated with his responses pertaining to possible recommendations to the patient regarding his 
mode of living; with strictness vs. permissiveness; activity vs. =: si of therapy; fre- 
quency of sessions; and the time required to achieve symptom relief. 
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Comparable analyses, for the full psychologist sample, disclosed the following 
results: 

1. A breakdown in terms of experience produced very few statistically significant differ- 
ences, with one notable exception: experienced therapists were less likely to recommend the more 
intensive forms of psychotherapy (p < .05). 

2. Attitude toward the patient was significantly associated with strictness vs. permissive- 
ness; anticipations of acting out; and discouragement of certain attitudes and behaviors. Stated 
otherwise, therapists expressing a negative attitude toward the patient tended to advocate 
strictness on the therapist’s (p < .01); were more likely to expect acting out behavior 
(p < .09); and were more likely to discourage oe self-depreciation, self-punishing behavior, 
refusal to accept responsibility, and projections of hostility (p < .06). 

3. Therapists giving a negative prognosis were more likely to expect the patient to act out 
than therapists whose prognosis was favorable. 


Therapists’ Communications (Film Responses). Therapists’ communications to the 
patient (film responses) were quantified by means of a system of analysis described 
previously“). The distributions obtained under the various components (type of 
therapeutic activity, depth-directedness, dynamic focus, initiative, and therapeutic 
climate) were then compared by means of the nonparametric Mann-Whitney U test 
to determine the degree of divergence between the two matched samples. These 
results may be summarized as follows: 
1. Psychiatrists differed from psychologists by asking a significantly larger number of ex- 
ploratory questions (p < .01). 
2. Psychologists in turn showed a relatively greater “arena ga for the reflection-of-feeling 
technique usually associated with Rogers’ client-centered theory (p < .001). 
3. Psychologists tended to change the dynamic focus less than psychiatrists and were more 
likely to accept the patient’s frame of reference (p < .002). 
. The two matched samples were not differentiated in terms of the relative frequency of 
“silent’’ responses, communications in which the therapist emerges as an expert or an authority, 
highly inferential communications, communications changing the dynamic focus to interpersonal 
events of the present or the past, communications showing a high level of initiative, and“warmth’”’ 
or ‘“‘coldness’”’ of emotional tone. 


ANALYSIS OF SYSTEMATIC DIFFERENCES 
Level of Experience. As in the preceding sections, the results obtained from 
breakdowns of the full psychiatrist sample were compared with corresponding an- 
alyses based on the full psychologist sample. For the psychiatrist sample, the follow- 
ing findings appeared to be attributable to length of experience in psychotherapy: 

1. Inexperienced psychiatrists tended to ask more exploratory questions than experienced 
therapists. 

2. Experienced psychiatrists tended to give a larger number of interpretive responses than 
inexperienced practitioners. Related analyses disclosed that these communications were more 
inferential, and that their dynamic focus concerned dynamic interpersonal events in the patient’s 
past and present life. 

3. Experienced therapists showed a higher degree of initiative in their communications than 
inexperienced practitioners. 

4, Concomitant with the above findings, experienced therapists tended to change the 
dynamic focus of their communications more than the less experienced respondents. 


5. Experienced therapists tended to be slightly ‘‘warmer’’. 


Most differences between inexperienced and experienced psychologists were 
statistically nonsignificant. However, communications in which the therapist emerg- 
ed as an expert or as an authority tended to increase with experience (p < .05), 
and ‘“‘warm”’ responses showed increments in the same direction (p < .05). 


Personal Analysis. Psychiatrists whose training had included a personal an- 
alysis differed from nonanalyzed psychiatrists, with whom they were matched in 
terms of experience (N in each sample was 28), by giving a significantly larger num- 
ber of silent responses at the expense of exploratory questions. This difference ap- 
peared to be more pronounced at the lower end of the experience continuum. Two 
similarly matched samples of psychologists (N in each sample was 33) were statistic- 
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ally undifferentiated in terms of their film response distributions on all components 
of the system of analysis. 

However, comparable analyses between psychiatrists and psychologists (see 
Table 1) disclosed that analyzed psychiatrists tended to exceed analyzed psycholo- 
gists in “‘silent’’ responses (p < .11) and exploratory questions (p < .09). Psycholo- 
gists, on the other hand, showed a consistent preference for reflections of feeling. A 
comparison between nonanalyzed psychiatrists and nonanalyzed psychologists cor- 
roborated the differences regarding explorations and reflections of feeling, but not 
regarding silence. This suggests that analyzed psychiatrists tended to be the relative- 
ly most “‘silent’’ of the samples, and that the other variations between psychiatrists 
and psychologists do not seem to be a function of the personal analysis variable. 


Theoretical Orientation. A comparison between two samples of psychiatrists 
whose respective theoretical orientations were orthodox Freudian and neo-Freudian 
failed to disclose any statistically reliable differences in their film responses. A 
similar comparison for psychologists proved unfeasible because of small N’s for 
“pure” subgroups. Certain differences between analytically oriented psychologists 
and those following the client-centered theory of Rogers have been reported else- 
where ®?, 


Attitude toward the Patient. A breakdown of the psychiatrist sample in terms of 
the therapist’s expressed conscious attitude toward the patient revealed no notable 
differences in techniques per se, but disclosed that therapists indicating a negative 
attitude gave a significantly larger number of responses whose emotional tone was 
rated ‘‘cold’’; ‘‘warm’’ responses did not differ significantly. Psychologists, in con- 
trast, did not reveal statistically reliable differences in terms of therapeutic climate, 
but certain technique differences were observed: 

1. Psychologists expressing a ‘“‘neutral’’ attitude toward the patient asked a significantly 

larger number of exploratory questions (p < .02). 


2. Concomitantly, those expressing a neutral attitude tended to accept the patient’s focus 
least; on the other hand, psychologists professing a positive attitude were more likely to accept 
the patient’s focus (p < .02). 


Prognosis. The following statistically reliable differences were observed when 
the psychiatrist sample was broken down in terms of the respondents’ prognostic 
estimates: 

1. Psychiatrists expressing a noncommittal prognostic estimate tended to give the smallest 
number of silent responses. 

2. Respondents giving a favorable prognosis were more likely to accept the patient’s focus. 

3. Therapists giving a noncommittal prognosis tended to show more initiative in their 
communications than the favorable or unfavorable groups. 


4. Therapists whose prognosis was unfavorable gave more than four times as many “‘cold”’ 
responses than respondents who considered the prognosis favorable. 


5. The relative frequency of “warm” responses was greatest among therapists whose 
prognosis was noncommittal and relatively low in both the favorable and unfavorable groups. 


The analysis for psychologists was not directly comparable because the number 
of respondents giving a noncommittal prognosis was too small. Results based on a 
comparison between respondents giving a favorable and an unfavorable prognosis 
showed that the latter group exceeded the former in the relative frequency of ex- 
ploratory questions (p < .05). 


Exploration of the Empathy Variable. As the analysis of the data progressed, it be- 
came apparent that therapists’ responses to various choice points in the motion 
picture as well as the over-all response patterns differed rather markedly in feeling 
tone, understanding of the patient’s plight, willingness to extend a helping hand, 
etc., — qualities which appeared to transcend the quantifications already discussed. 
This impression was strengthened by the observation that therapists admitting to a 
negative attitude toward the patient tended to be “‘colder’’ in their communications, 
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For these reasons, an attempt was made to define criteria for judging the degree otf 
empathy conveyed by the therapist. 

These criteria have been described in greater detail in the earlier paper. The judgment was a 
simple over-all rating of the therapist’s response behavior at the 28 choice points in the film. If 
the respondent appeared to empathize with the patient, he was given an over-all rating of plus; 
if there was evidence of coldness, aloofness, rejection, etc., a rating of minus was assigned; if the 
record provided insufficient evidence to warrant either a score of plus or minus, a “‘neutral’’ 
rating (0) was given; if the record contained both plus and minus indicators, an “ambivalent’’ 
rating (+) resulted. Conceptually, there appeared to be an affinity between ratings of + and +, 
on the one hand, ard between — and 0, on the other, so that for most subsequent analyses a plus- 
minus dichotomy was used. 

Despite the crudeness of these measures it was possible to demonstrate fair rater agreement. 
Successive ratings by each of two raters showed a consistency ranging from 69% - 85%; inter- 
rater agreement ranged from 64% - 72%. These indices were similar for the psychiatrist and 
nonmedical therapist samples, which were judged separately. The results for the latter sample 
(including 15 psychiatric social workers) are presented in Table 3. 


TABLE 3. AGREEMENT WITHIN AND BETWEEN Two INDEPENDENT RATERS ON OVER-ALL 








Ratines oF Empatuy (N = 93) 
Full Partial Disagreement 
Agreement* Agreementt 

Rater 1: Rating 1 vs. 

Rating 2 72 (77%) 7 ( 8%) 14 (15%) 
Rater 2: Rating 1 vs. 

Rating 2 60 (65%) 13 (14%) 20 (21%) 
Rater 1 vs. Rater 2: 

Rating 1 47 (51%) 14 (15%) 32 (34%) 
Rater 1 vs. Rater 2: 

Rating 2 40 (43%) 27 (29%) 26 (28%) 





Note.—N comprises 64 psychoanalytically oriented psychologists, 14 Rogerians, and 
15 psychiatric social workers. 

* Using four categories: +, —, 0, and +. 

t+ Combining categories + and +, and 0 and —. 


In the psychiatrist sample, empathy ratings showed statistically significant 
associations with therapists’ conscious attitude and prognosis, indicating that 
positive empathy tended to covary with positive attitude and favorable prognosis. 
For the nonmedical therapist sample, comparable analyses revealed a similar trend, 
but chi square was not significant at an acceptable level (.20 < p > .15). There was 
a slight tendency for experienced psychiatrists to have higher empathy ratings 
(p = .13); for nonmedical therapists, chi square was significant at the .09 level. 

When the empathy ratings of psychiatrists whose training had included personal 
analysis were compared with nonanalyzed psychiatrists of comparable professional 
experience, the results were in favor of the analyzed group (p < .07). At the lower 
experience levels (less than 3 years), there appeared to be no difference between 
analyzed and nonanalyzed therapists in terms of empathy ratings; however, at the 
higher levels of experience, analyzed therapists obtained a larger proportion of posi- 
tive empathy ratings. For a similarly matched sample of nonmedical therapists 
(N = 33 in each group, including 29 psychologists and 4 psychiatric social workers), 
the over-all results were statistically nonsignificant, but at the higher levels of ex- 
perience the above mentioned trend appeared to be in evidence (Table 4). 

A further breakdown in terms of the therapist’s conscious attitude toward the 
patient showed that in the experienced analyzed group of psychiatrists, a negative 
attitude appeared to be associated more often with a positive empathy rating, 
whereas in the nonanalyzed experienced group there seemed to be greater congruence 
between negative attitude and lack of empathy. No corroboration for this finding 
was found in the nonmedical therapist sample. It is apparent that the results per- 
taining to the relationships between empathy ratings, personal analysis, and con- 
scious attitude must be viewed with considerable caution, first, because of the very 
small size of the subsamples, and second, because the error ot measurement in each 
variable is undoubtedly compounded when these variables are considered jointly. 
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TABLE 4. DisTRIBUTIONS oF Empatuy RatTINGs AND SELF-RatinGs oF ATTITUDE IN Two Groups OF 
PsycHo.oaists, MATCHED IN TERMS OF PERSONAL ANALYSIS 





Experience Attitude Personal Analysis No Personal Analysis 
in Therapy toward Empathy Rating Empathy Rating 
Patient + _ Total + _ Total 





7 
9 
16 


9 
8 
17 


16 
17 
33 


10 
6 
16 


6 
11 
17 


oa 
0 - 2 years 
Total 


— 
tNostor 





2.5 - 12 years 
Total 

Combined 
Total 


— 
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SUMMARY 


This investigation was undertaken for the purpose of obtaining empirical evi- 
dence on therapists’ performance in an initial interview and to explore the effects of 
relevant therapist variables upon (a) clinical judgments, including diagnostic and 
prognostic evaluations, (b) formulations of treatment plans and goals, (c) communi- 
cations addressed to the patient. Psychotherapists were asked to participate as 
vicarious interviewers while being exposed to a sound film of an unrehearsed initial 
interview. Subsequently, they were asked to record their clinical impressions. Data 
were collected from 235 therapists representing a wide range of experience and train- 
ing. This paper is concerned with the performance of 55 psychiatrists who were 
compared with a matched sample of 55 psychologists. In addition, an attempt was 
made to cross-validate certain analyses based upon the full sample of psychiatrists 
(N = 134) on the full sample of psychologists (V = 64). 

The clinical evaluations of psychiatrists were similar to those of psychologists 
of comparable experience despite marked intragroup differences. In some respects, 
psychologists as a group appeared to be more passively expectant in their therapeutic 
approach than psychiatrists. Psychiatrists’ communications showed a preference for 
exploratory questions whereas psychologists disclosed a predilection for reflections of 
the patient’s feelings. In both groups of therapists, clinical evaluations tended to 
become more unfavorable with increasing experience. In terms of therapeutic tech- 
nique, experienced psychiatrists tended to give a larger number of interpretations 
and showed more initiative. Psychologists did not show this pattern in clear-cut 
fashion. However, experienced therapists in both groups tended to be ‘“‘warmer”’ in 
their communications to the patient. 

Therapists’ attitudes toward the patient, among psychiatrists as well as among 
psychologists, were shown to be correlated with certain clinical evaluations, treat- 
ment plans, and the emotional tone of their communications. In discussing the 
implications of these findings”, attention has been called to the potential effects of an 
interaction between therapists’ emotional reactions to a patient, assessments of the 
“objective” clinical evidence, and the character of the therapeutic interventions. 
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CRITERIA OF THERAPEUTIC RESPONSE IN HOSPITALIZED 
PSYCHIATRIC PATIENTS* 
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INTRODUCTION 


Eysenck’s“ recent review of evidence on the effectiveness of psychotherapy has 
freshly pointed up major inadequacies in existing knowledge concerning the power of 
interviewing techniques for psychiatric treatment. Miles, et al,“ observe that, 
‘‘When the various studies are compared in terms of thoroughness, careful planning, 
strictness of criteria, and objectivity there is often an inverse correlation between 
these factors and the percentage of successful results reported.” It might be assum- 
ed that a large part of the absence or inadequacy of evaluative evidence on psycho- 
therapy stems from the fact that so much of it is conducted on out-patients who are 
not accessible for consistent and frequent evaluative observation and for whom 
acceptable criterion data on personality change are difficult to obtain. By contrast, 
one might assume that more extensive and satisfactory data exist on the results of 
various treatments of hospitalized patients. Careful survey of the literature reveals 
this not to be so. Too frequently the evaluation of effectiveness of procedures such 
as electro-convulsive therapy (ECT) is limited to global categorizations of treated 
patients as “improved”’, “unimproved”’, etc., with such judgments usually limited 
to the clinical appraisals of one or two psychiatrists. The well recognized unreliabil- 
ity of such unstructured, global, clinical appraisals requires no comment. If the 
actual efficacy of psychiatric therapies is to be appraised (with consequent appre- 
ciation of the need for improving the selective application of existing therapies and 
for discovering new treatments), clinical evaluations of the changes associated with 
treatment must be bolstered by regular, objective, repeatable psychometric tests, 
and structured observations and ratings. 

The purpose of this study was to investigate the relative validities and the 
interrelationships of three sets of data bearing on the immediate responses of psy- 
chiatric patients and one measure of long term response to therapy. These data are 
to be examined for two sets of patients, those receiving electro-convulsive therapy 
(ECT) and those not receiving ECT. The four kinds of intormation were: (a) pre- 
and post-therapy MMPI profiles; (b) daily behavior ratings recorded by the nursing 
staff; (c) the usual clinical evaluation at the time of discharge; and (d) data on sub- 
sequent rehospitalization. 


SUBJECTS 


The subjects all had been discharged from the psychiatric service at the Uni- 
versity of Minnesota Hospitals.' The selection of cases rested upon the joint occur- 
rence in the record of the following data or conditions: (a) A period of hospitaliza- 
tion of not less than 30 nor more than 75 days; (b) in the case of patients receiving 
ECT, a course of no fewer than 10 nor more than 25 treatments; (c) a diagnosis of 
involutional psychosis, manic depressive disease, schizophrenia or psychoneurosis; 
(d) the presence of both pre- and post-treatment MMPI profiles; and (e) the avail- 
ability of behavior ratings by the nursing staff extending over the period of hospital- 
ization. 

The major descriptive characteristics of the four diagnostic sub-groups constitut- 
ing the final sample of 100 patients are reported in Table 1. It is obvious in view of 
the selection procedures that these 100 patients do not constitute a random sample 
from all those received and. treated by the Division of Psychiatry. 


*This study was supported by a research grant from the Graduate School of the University of 
Minnesota. 
1Cases reported here overlap the cases reported by Briggs. 
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$ 
TaBLe 1. Mason CHARACTERISTICS OF THE SAMPLE 








Median Sex Days Hosp. Re- | Per Cent* 
Diagnosis N Age F M Range Hosp. Improved 





Patients Not Receiving ECT 





Involutional 1 (26) 
Manic Depressive 2 33 - 66 
Schizophrenia é Y 9 27 - 75 
Psychoneurosis 3 31-71 




















Sub-Total | 5. 24 15 | 26-75 





Patients Receiving ECT 





Involutional f 10 
Manic Depressive 5 8 
Schizophrenia 5 ; 10 
Psychoneurosis 5 5.5 6 


Sub-Total | 5. 34.2 20 
Grand Total 5 58 : 5 31 





























*The per cent of each group which was clinically judged “improved’’ by the psychiatric staff at 
the time of discharge. 


MEASURES OF THERAPEUTIC SUCCESS 


MMPI. As has been indicated above, one of the procedures applied to checking 
the response to therapy was the MMPI. Each patient had provided a valid MMPI 
record nearly encompassing the period of hospitalization. The median number of 
hospital days not included within this interval is only 2.25; hence the two profiles 
may be designated appropriately as pre- and post-therapy measures. These paired 
profiles were categorized by clinicians. These judges, two psychiatrists and six 
clinical psychologists, were instructed to sort into three categories: (a) A group 
showing definite improvement; (b) A group showing lack of improvement; and (c) an 
uncertain group. 

Each judge sorted the 100 profiles three times so that the effect of knowing dis- 
charge diagnosis was examined, as well as intra-judge and inter-judge reliability. 
The knowledge of the discharge diagnosis had no noticeable effect upon the distri- 
bution of judgments and this variable was ignored in obtaining the composite re- 
sults. Averaging over all judges and sortings, only 13.7% of the judgments were 
‘‘uncertain.”’ The amount of agreement within judges (repeat-sort reliability) when 
the unimproved and uncertain categories were combined ranged from 76% to 98%. 
Agreement between ratings of judges for the “improved” category exceeded 90% for 
seven of the eight judges. 

On the basis of the initial MMPI sortings by the eight judges, each patient was 
given a final classification of “improved”, “unimproved”, or “uncertain.” The res- 
pective percentages were 61% “improved” and 39% “unimproved” or “uncertain.” 
Patients were classified in the ‘“‘uncertain’”’ category if fewer than five of the seven 
“reliable’”’ judges agreed in calling the case either “improved” or “unimproved”’.? 


Nurses’ Ratings. A second evaluative procedure involved application of a 
Nurses’ Rating Scale for Patient Behavior.’ The averages of the daily ratings of each 


*The algebraic sum of the differences between “intake’’ and “discharge’’ clinical scale T-scores 
dichotomized at 28.5 will duplicate this composite judgment, with 15% of the cases classified better 
than the clinical judgment and 2% classified worse than the clinical judgment of the profiles. 

’This observational schedule was devised by Schofield and placed in use on the psychiatric service 
in 1948. An extensive revision of this instrument suitable for a chronic psychiatric population has been 
reported by Lucero. ®) The scales provide daily recording, on a five-step basis, of Colevler of patients 
in fifteen different areas. 
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patient for each day of hospitalization were computed and these averages contri- 
buted the basic data for other statistical analyses used in appraising the behavior 
records. 

The period of hospitalization for each patient was dichotomized at the median 
day. For the first and second halves the averages and variances of the behavioral 


2 
ratings were calculated. The ratios, = and at were also obtained. 
1 


1 
The ratio of means would suggest improvement when greater than one. Similar- 
ly, less variability in behavior ratings in the second half of hospitalization would ap- 
pear to be an expression of improvement. Thus, the nurses’ ratings were reduced. to 
two estimates of condition upon discharge, each derived from a comparison of the 
first and second halves of the period of hospitalization. 


The distribution of = was found to be symmetrical about a mode at 1.00 


1 
2 


(median = 1.05) and markedly leptokurtic. The distribution of =) showed ex- 


1 
treme positive skewness with a range from .05 to 29.00 and a modal value in the 


interval 0.00 to 0.49 (median = .94). For approximately half of the patients there 
was a general tendency for the level of behavior ratings in the second half of hospital- 
ization to equal or be greater than the average level of the first half of hospitalization. 
The variability in the samples appears to decrease. 


Psychiatric Evaluation. The third set of data utilized in this study was provided 
by the statements concerning “condition on discharge’’ found in the formal dis- 
charge summary of each patient. Evaluation of the patient’s status at time of dis- 
charge is primarily the responsibility of a psychiatric resident whose formal, written 
summary of each case is checked and approved by a staff psychiatrist. Evaluation of 
the patient’s discharge condition is essentially a clinical judgment expressive of in- 
tuition, impression, and inference based upon casual observation, nurses’ oral reports 
and interview. To a small but rather consistent extent, the material in the nurses’ 
daily notes (N. B. not the Behavior Ratings) are utilized in this appraisal. To a 
greater extent, but less consistently resident to resident, information on the MMPI’s 
as provided by the psychology staff is applied in judging discharge status. There 
is, in other words, a degree of contamination of the clinical judgment by the data of 
both the MMPI and the behavior ratings; such contamination appears insufficient 
to invalidate the study of interrelationships among these variables. At no time did 
the clinical staff have available the graphs of the daily mean behaviorial ratings as a 
basis iad their evaluation. Their clinical evaluations called 91% of the cases im- 
proved. 


Rehospitalization. The final criteria are post-hospital follow-up data. This 
information was collected through the state Central Index which records all cases of 
hospitalization in the state institutions. With a median follow-up period of 5.8 
years, the base rate of re-hospitalization in the total sample was 31%. 


INTERRELATIONSHIPS OF THE CRITERIA 

The MMPI and Clinical Judgments. It may be noted that the clinical judg- 
ments of the psychiatric staff identified all cases as either improved or unimproved 
with no uncertain or indefinite cases. By contrast, the MMPI sortings were uncer- 
tain for one-fifth of the patients. While in the clinical evaluations 91% of the 
total sample were “improved’’, only 61% were so designated in terms of changes in 
their MMPI profiles. Of the 91 cases called improved by the psychiatric staff, 14 
were considered unimproved on the basis of their MMPI profiles. These 14 cases 
were examined with respect to their graphed, daily mean behavior ratings as re- 
corded by the nurses. The record of re-hospitalizations was also noted. 
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Only five of these disputed cases had > ratios in excess of 1.00 and the highest 


1 
of these was 1.18. The mean ratio for these 14 cases was .97. The mean value of 
2 
= was .90. The variance for the second half of hospitalization for 10 of the 14 
1 
cases was 70% greater than that in the first. Thus, by neither of these indices does 
this group show convincing evidence of improvement. In a sense, the nurses’ ob- 
servations of these patients recorded in an “he is” system were in accord with 
patients’ ‘‘I am” evaluation in the MMPI, and both of these sets of data indicated 
lack of improvement. Follow-up information of this sample of 14 disputed cases 
shows that 5 (36%) have been re-hospitalized. The base rate of re-hospitalization 
for the total is 31%. 


TABLE 2. JormntT DISTRIBUTION OF CLINICAL EVALUATION OF “CONDITION ON DISCHARGE”’ 
AND MMPI JupGments For 100 Casgs 








| ECT (N = 61) Non-ECT (N = 39) 


Unimproved & 
Uncertain 


Condition 





| Unimproved & 
Improved | Uncertain 


on 
Discharge 
42 | 15 17 | 1 91 
9 9 


Improved 





Improved 
9 


22 100 





Total 4 COS 17 17 








| 
| 
| 
| 
Unimproved | 2 2 0 





Table 2 presents a joint distribution of the two evaluations for the patients 
treated with ECT and those not receiving this treatment. Here the categories of 
“uncertain” and “unimproved”? MMPI’s have been combined. Table 3 presents the 


TABLE 3. Jomnt DISTRIBUTION OF CLINICAL EVALUATION OF ‘‘CONDITION ON DISCHARGE”? AND MMPI 
JUDGMENTS FOR 31 PatreENts WHo WERE REHOSPITALIZED 








MMPI Judgments 


ECT (N = 20) Non-ECT (N = 11) 





Condition 


on 
Discharge 


Improved 


Unimproved & 
Uncertain 


Sub- 
total 


Improved 


Unimproved & 
Uncertain 





Improved 
Unimproved 


10 
2 


6 
2 


~ 


16 
4 


3 
0 


6 
2 


9 
2 





25 


6 





Total 





12 


8 








20 





3 





8 





11 


31 





re-hospitalization frequencies for the cases in Table 2. Briefly we should note that 
the rate of re-hospitalization among cases judged improved on the basis of the 
MMPI is only 25%. Cases judged improved by the staff are re-hospitalized 27% of 
the time. Only 22% of cases judged improved jointly by the MMPI and staff were 
re-hospitalized. This is a small reduction in the base rate of hospitalization. By con- 
trast, there was a 55% rate of re-hospitalization in cases called “unimproved” or 
“uncertain”’ in both clinical and MMPI judgements. Concordance of MMPI and 
clinical evaluations occurred for only 66% of the cases and exceeded the accuracy of 
base rate prediction (69%) by only 6%. 

The clinical psychiatric evaluations of ‘improved”’ were correct for 95% of the 
69 patients who were not rehospitalized, but the clinical judgment of ‘‘unimproved”’ 
was assigned to only 19% of the rehospitalized group of 31 patients. The clinical 
appraisals thus had an overall “‘hit’’ rate of 72%, and a proportionately high rate of 
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false negatives (Table 3), since 25 of the 31 rehospitalized patients, or 80%, had been 
classed as “improved” by this criterion. 

The MMPI judgments of “improved” were correctly associated with only 66% 
of the non-rehospitalized cases, and those of “unimproved” with 50% of the re- 
hospitalized cases. The overall “hit’’ rate for the MMPI was 62%. In contrast to 
the clinical “condition on discharge’’ evaluations, the MMPI yielded a proportion- 
ately high rate of ‘false positives’, since 33% of the non-rehospitalized cases had 
been judged “unimproved” in terms of MMPI changes. 


Behavior Ratings. Neither of the behavior rating indices related significantly to 
re-hospitalization. Neither did the behavior ratings taken in combination with other 
variables relate significantly to re-hospitalization. Figure 1 reports graphically the 
daily mean behavior ratings for these patients in each of four diagnostic groups who 
received courses of ECT. The daily averages are reported for the five days preceding 
ECT, over the course of ECT, and for the five days immediately following the last 
treatment. The rank-order correlation between the four diagnostic samples with 
respect to their behavioral ratings before and after ECT is -.60. The treatment had 
far from uniform effect on the four groups and the amount of improvement afforded 
by ECT was inversely related to the relative status of each diagnostic group initially. 
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Maximum behavioral improvement is shown by the manic-depressive group which 
reveals a tendency toward further improvement after cessation of ECT. By con- 
trast, and surprisingly in view of the general impression provided by the literature, 
the sample of involutional patients showed marked variability in their behavioral 
level over the course of therapy and a post-therapy level somewhat inferior to their 
pre-therapy status. The nurses’ ratings appear to be most provocative when used in 
this type of comparison between different groups in treatment. 


SUMMARY AND CONCLUSIONS 


A sample of 100 diagnostically heterogeneous hospitalized psychiatric patients, 
61% of whom had received ECT, furnished the data for a study of the validity and 
inter-relationships of three expressions of patient response to treatment. The three 
measures were pre- and post-treatment MMPI profiles, daily nurses’ behavior rat- 
ings, and clinical evaluation of ‘‘condition on discharge.” The criterion against which 
each of these evaluations was checked was occurrence of rehospitalization. 

Thirty-one per cent (31%) of the total sample was rehospitalized. Using the 
factor of rehospitalization or non-rehospitalization as a criterion to validate im- 
mediate post-therapy judgments of patient response, the global clinical judgments 
had an overall ‘‘hit’’ rate of 72% and those based on MMPI changes were accurate 
for 62% of the cases. The clinical appraisals ran a very high rate of false negatives, 
calling 80% of the rehospitalized patients improved. Predicting rehospitalization or 
not in terms of concordance of the MMPI judgments and clinical evaluation per- 
mitted prediction for only two-thirds of the cases and the overall increase in percent 
of “hits” only slightly exceeded base rate prediction. 

Behavior ratings based on daily observations of the nursing staff were not 
reliably associated with the rehospitalized vs. non-rehospitalized criterion. The 
nurses’ ratings did demonstrate interesting variation in response of different diag- 
nostic groups to ECT. 

The degree to which the clinical evaluations exceeded the accuracy of the 
MMPI judgments is associated with their better reflection of the base rate char- 
acteristics of the criterion, 7.e., a better than two-to-one prevalence of “non-rehospi- 
talization.’”’ It must be noted that occurrence of rehospitalization is a somewhat 
“distant’”’ and gross measure of psychiatric status and that the greater prevalence of 
“unimproved” patients in terms of MMPI comparisons may validly reflect the pre- 
valance of mild psychiatric residuals following hospital treatment. 
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THE RELIABILITY OF CLINICAL JUDGMENTS OF 
ASOCIAL TENDENCY! 
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INTRODUCTION 


In view of the high reliability demonstrated when trained clinical psychologists 
were asked to rate on a seven-point scale how schizophrenic were a series of patient 
responses to vocabulary test items®?, the present authors were encouraged to in- 
vestigate what would happen to the reliability of clinical judgment if instead of the 
general dimension of schizophrenia, the judges were asked to make ratings on more 
specific dimensions representing individual aspects of schizophrenic thinking“. The 
dimensions used were potential intelligence, communicability, and concrete-abstract. 
While there was some diminution of reliability as the clinicians moved from the gen- 
eral to the more specific dimensions, reliability remained sufficiently high to indicate 
the promise of such scaling techniques in clinical practice. 

The present paper carries this study of clinical reliability into a different cate- 
gory of disorder employing a different dimension of judgment and another type of 
test item. Specifically, we were interested in the reliability of the scaling judgments 
obtained when clinicians were asked to rate the amount of asocial tendency exhibited 
in responses to Comprehension sub-test items from the Wechsler-Bellevue scale 
given to a group of Naval disciplinary cases. This choice was dictated by one auth- 
or’s previous experience who found during developmental work on the CVS abbre- 
viated intelligence scale that bizarre, asocial answers ‘requently were given by dis- 
ciplinary cases when tested with Comprehension items. As in our previous studies, 
ratings were also obtained from a comparison group of undergraduate student judges. 


PROCEDURE 

The clinical psychologists used were 15 professional clinicians scattered over the 
United States area. All had served in some of our previous experiments on the re- 
liability of clinical judgment and were familiar with the use of a seven-point scale in 
rating the schizophrenic nature of responses to Vocabulary test items. As a criter- 
ion for their original selection all possessed the Ph.D. degree and at least four years 
of full time clinical experience. The naive subjects were 15 undergraduates from a 
psychology course at Northwestern University. f 

The stimuli were answers to Comprehension test items from the Wechsler- 
Bellevue scale given to a group of Naval disciplinary cases from a detention barracks, 
some of whom had received psychiatric diagnoses as asocial psychopaths. From the 
total pool of responses available, the authors selected 50 items which they felt cover- 
ed the complete range of asocial tendency exhibited. Thus in response to the question, 
‘“‘Why should people pay taxes?’’, an answer of “to support the government” was not 
considered to exhibit any asocial tendency. On the other hand, answers of ‘“Any- 
thing you get you pay for’’, ‘“To keep politicians alive’, “Damned if I know”’, and 
“Why should we?” were considered to represent varying amounts of asocial tendency. 

The stimuli were mimeographed on sheets and distributed to the raters as in 
previous experiments @’ 4) with instructions as follows: 

“We are going to present you with a number of test responses to Comprehension items from 
the Wechsler-Bellevue scale. These responses were taken from a group of military disciplin 
cases that included both diagnosed and presumably some undiagnosed behavior disorders as on 
as psychiatrically normal individuals. No cases of mental deficiency, psychoneuroses, or psychoses 
are included. We are asking you to rate these responses for the amount of asocial tendency ex- 
hibited in the response. Please use a 7 point scale, with point ‘1’ representing no asocial tendency 
and point ‘7’ pape. an extreme amount. We are not interested in the ‘correctness’ or ‘in- 
correctness’ of the response, merely in any sign of asocial tendency that you may discover in it. 
1This study derives from a larger project subsidized by the Office of Naval Research under con- 

tract 7onr-450(11) with Northwestern University. 
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“On the data sheet we have indicated the particular item from the Comprehension test by a 
cue word following Wechsler’s practice. For your convenience we are repeating below the cue 
words, and the complete question to which each refers: 

ENVELOPE—What is the thing to do if you find an envelope in the street, that is sealed, and 

addressed and has a new stamp? 

THEATRE—What should you do if while sitting in the movies (theatre) you were the first 

person to discover a fire (or see smoke and fire)? 

Bap Company—Why should we keep away from bad company? 

TaxEs—Why should people pay taxes? 

Laws—Why are laws necessary? 

MarrIAGE—Why does the state require people to get a license in order to be married? 

“Tn summary, you are to concentrate on asociality as it characteristically appears in the 
asocial variety of psychopathic personality and to use a 7 point scale in which ‘1’ indicates minimal 
asociality and ‘7’ maximal.”’ 


We have defined reliability as meaning agreement of the individual judge with 
the group and measured it by means of a Pearson’s r comparing the individual values 
assigned to the stimuli by each judge with the mean values assigned by the total 
group, correcting for the influence of the individual judge upon the group. 


RESULTS 


The reliability coefficients obtained show excellent agreement among our trained 
clinical psychologists. They range from a high of .92 to a low of .64, with a median 
at .82. These are comparable with those found in our study of the schizophrenic 
nature of Vocabulary responses®?. A random splitting of the group into sub-groups 
of 7 and 8 judges each produced an r of .94. It is obvious that clinicians can make 
these ratings with good agreement among themselves. 

For the naive undergraduates the results are almost as good. The reliability co- 
efficients range from .88 to .51, with a median of .72. Splitting the judges into groups 
of 7 and 8 gives an r of .93. It is obvious that the undergraduates are agreeing 
among themselves. 

That they are also agreeing with the clinicians is revealed by comparing the 
mean values assigned to the stimuli by each group. The correlation here is .91. As 
in our previous findings, the difference between the mean values assigned the stim- 
uli were not significant. Contrary to our previous findings, however, the difference 
between the o of the means also was not significant, indicating less difference in 
the variability of judgment for the two groups. 


DIscussIoN 


Our results, then, indicate a sufficiently high reliability for clinical judges in 
this situation to indicate the potential value of such scaling techniques as a diag- 
nostic and general evaluative method. The high reliability shown by the under- 
graduates as well as their agreement with the clinicians opens the possibility of using 
lower level, less trained, ancillary personnel for some rough screening decisions. In 
this regard, however, it is well to remember that the authors have found in dealing 
with ratings for various aspects of schizophrenic thinking that when finer distinc- 
tions are called for, such as rating for “potential intelligence’’, “communicability”’, 
and “‘concrete-abstract”’, rather than a broad general dimension of deviation such as 
“schizophrenic’’, naive undergraduates were not able to keep up with the perform- 
ance of trained clinicians“. It is of interest to note that no responses were scaled at 
the upper extreme (point 7) by the judges. This may well be a function of the orig- 
inal selection of stimuli, but it duplicates the findings of Hunt and Arnhoff in 
scaling Comprehension items for schizophrenia. 

In conclusion, it may be of interest to give a few examples of the scaling. In 
answer to the question, “What is the thing to do if you find an envelope in the 
street, that is sealed and addressed, and has a new stamp?’’, the response ‘“‘Mail it” 
was rated 1.00 with o of 0.00; “Just walk by it’”’, 3.93 with o 1.18; “Look through it’’, 
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5.00 with o 1.21; and ‘Take the contents and destroy it’’, 6.07 with o 1.84. To the 
question “Why does the state require people to get a license in order to be married?”, 
an answer of ‘“‘That’s the rule’’ is scaled at 2.20 with o .74, and “To keep them from 
marrying more” at 3.66 with o .94. 


SUMMARY 


Trained clinicians and naive undergraduates were asked to rate on a seven-point 
scale the amount of asocial tendency present in responses to Comprehension test 
items obtained from a group of Naval disciplinary cases. The clinicians showed 
high inter-judge agreement, with the undergraduates performing almost as well. 
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CLINICAL JUDGMENT OF SOME ASPECTS OF 
SCHIZOPHRENIC THINKING! 


WILLIAM A, HUNT AND NELSON F., JONES 
Northwestern University 


As part of a comprehensive study of clinical judgment®: ®, the authors and 
their colleagues have investigated the reliability of judgment in a task involving 
rating on a seven-point scale the amount of schizophrenic thinking exhibited in 
patients’ responses to vocabulary test items“: * ¢ 5), Both trained clinicians and 
naive subjects were used as judges. Excellent reliability was obtained in this typical 
clinical task and ‘‘The naive and trained groups show good agreement in their evalu- 
ation of the stimuli, with the effect of clinical training showing significance only in 
the improved reliability or higher interjudge agreement manifested by the pro- 
fessional clinical psychologist’. 

The present study investigates whether or not this high reliability will continue 
to be found when specific dimensions or aspects of schizophrenic thinking, 7.e., the 
potential intelligence exhibited in the response, the communicability of the response, 
and its position on a scale of concreteness-abstraction, are substituted tor the general 
dimension of “how schizophrenic each of these responses is.’’ Moreover, as a more 
specific, supposedly more demanding task is substituted for the general one, will the 
gap in ability between the trained clinicians and the naive subjects increase, or will 
the performance of the naive subjects continue at its previous high level? Our specific 
hypotheses were: (a) That reliability of judgment would decrease as we move from 
the general dimension to the more specific, refined ones; and (b) that with the specific 
dimensions the difference in ability between trained and naive subjects would become 
more pronounced. 

In addition it was hoped that the intercorrelations between the various dimen- 
sions used might shed further understanding on the judges’ conceptions of schizo- 


roject subsidized by the Office of Naval Research under con- 
Jniversity. 
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phrenic thinking, since the dimensions used were selected to represent conventional 
approaches to schizophrenic thinking. Here we preferred to proceed in an explora- 
tory fashion without specific hypotheses. 


PROCEDURE 


The stimuli were 50 schizophrenic vocabulary responses used in our previous 
studies“), The task called for quantitative judgments on a seven-point scale on each 
of the three dimensions, potential intelligence, communicability, and concrete- 
abstract. The specific instructions for each dimension follow: 


Intelligence. “‘We are going to present you with a number of schizophrenic test responses to 
vocabulary items taken from the Wechsler-Bellevue and Terman intelligence scales. In previous 
studies you have rated these responses for the degree of schizophrenic involvement represented. 


This time we are interested not in the ‘schizophrenicity’ of the responses, but in the potential 
intelligence represented in the response. Please use a seven-point scale with point ‘1’ represent- 
ing mental deficiency and point ‘7’ representing very superior intelligence, with intermediate 
points representing an ascending order of intelligence. I have spoken of potential intelligence, 
rather than functional intelligence as we are not interested in the conventional ‘rightness’ or 
‘wrongness’ of the responses per se. The correctness or incorrectness of the responses can easily 
be obtained using the objective scoring norms provided for the tests from which they are drawn. 
We are interested rather in the signs of intelligence exhibited in the response, however uncon- 
ventional or incorrect it may be. In asking you to judge the potential intelligence exhibited in the 
response, we are in a way asking you to estimate the pre-illness intelligence of the subject making 
the response, the level at which he was capable of operating before schizophrenia entered to upset 
his intellectual functioning. Since these are all pe wae» shen responses, many of them will be 
confused, even incorrect in the light of conventional scoring techniques. Yet in the vocabulary 
level represented, in grammatical structure used, in the sweep and intricacy of the symbolism 
exhibited, some of these objectively ‘erroneous’ responses nevertheless will give clear evidence 
that the response could only have come from a subject originally of high intelligence. Some are 
obviously the product of less fortunate subjects. 

‘An illustration is in order. Let us use two examples: Pewter: Something that don’t smell 
so good. This is obviously incorrect. Yet above re | beyond this incorrectness, the quality of 
expression and the language used do not indicate a high level of intelligence. On the other hand: 
EsPIONAGE: a secret service, like the word of a man that has different aliases. This is a confused 
response, but in the level of the vocabulary used and in its tangential comprehension of the mean- 
ing of espionage it requires a higher level of original intelligence than did the former example. 

‘We realize that the nature of the stimulus words themselves, when they are at a low level 
of vocabulary, sometimes sets arbitrary limits to the amount of intellect that can be exhibited in 
the response. We realize that the confusion inherent in the disorder itself acts to conceal the in- 
telligence potential originally present. We realize that the cues upon which you will base your 
judgment are complicated and complex. Yet it is the sort of judgment you commonly make in 
clinical practice. We ask you to make it here.” 


Communicability. ‘‘We are going to present you with a number of schizophrenic test res- 
onses to vocabulary items taken from the Wechsler-Bellevue and Terman intelligence scales. 
= ig you have rated these responses for the degree of schizophrenic involvement repre- 

sented. 

“This time we are not interested in the ‘schizophrenicity’ of the responses, but in their com- 
municability, whether the response communicates a public, understandable meaning or whether 
it represents a purely personal, private meaning whose significance is apparent only to the patient 
himself. Please use a seven-point scale with point ‘1’ representing a ‘private’ response intelligible 
only to the subject, and point ‘7’ a ‘public’ response whose meaning is clear to anyone, with the 
intermediate points representing an ascending order of communicability. We are not interested 
in the schizophrenic pathology or in whether the response is correct or incorrect, but only in how 
clearly and unambiguously the subject has communicated what he considers to be the meaning 
of the word. 

“Let us illustrate: Fasie: the right thing when somebody else is equally right. Whatever the 
— meaning of this response is to the patient he obviously fails to communicate it to others. 

his response should be low on our scale. Fur: method of warmth of an animal. While this 
response is obviously schizophrenic, the subject is communicating clearly what the word fur means 
to him at the moment. The response is easily understood, and would be placed high on our high 
scale. 

“Your judgments then will cover a continuum ranging from completely private associations 
for which any public, consensual validation is lacking to public, unambiguous associations whose 
communication is clear and unmistakable.” 


Concrete-Abstract. ‘‘We are going to present you with a number of schizophrenic test res- 
ponses to vocabulary items taken from the Wechsler-Bellevue and Terman intelligence scales. 
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Previously you have rated these responses for the degree of schizophrenic involvement repre- 
sented 

“This time we are interested in your judgment of how ‘concrete’ the response is. We are 
assuming the classic concrete-to-abstract dimension on which thinking is often held to vary. An- 
other way of saying this might be to speak of the ‘inclusiveness’ of the concepts expressed in the 
response, whether the concept used has a highly specific object reference or whether it refers to a 
general class of objects. Please use a seven-point scale with point ‘1’ referring to a highly concrete 
response with specific, restricted reference and point ‘7’ referring to a hi hily general, abstract 
response whose reference is inclusive and which points to a broad class of m cues acs rather than 
a single concrete instance, with intermediate points representing an ascending order on the 
concrete-abstract dimension. 

‘As an illustration: Hero: we got the hero like some people in service in the front lines. Here 
the subject grasps the concept of ‘hero’ and is able to communicate it fairly well but does so 
through a rather limited, restricted reference to a specific example. This would seem to place 
somewhere on the lower portion of our scale. 

Bicyc.e: a riding transportation for people. 

onse relates the stimulus to the broad category of methods o' 
identified as one of a class of objects included in the abstraction, 
this response places high on our scale. 

“Your judgments then will place the response on a continuum ranging from highly restricted, 
non-inclusive conceptualization referring to a single specific instance or object, to highly perm- 
eable, minimally restrictive conceptualization which includes many relevant instances, objects, 
or qualities. In this sense, is the response concrete or abstract, is it specific or inclusive?” 


While the p amersen may not be apt, this res- 
human transportation. ‘Bicycle’ is 


‘transportation’. Obviously, 


Since the professional clinicians all previously had participated in a study in- 
volving rating ‘‘how schizophrenic each of these responses is’’“?, it was possible to 
compare their judgments on this general dimension with the three specific dimen- 
sions used in the present study. As the naive subjects had not performed in the pre- 
vious study, comparable data on the general dimension was obtained from some of 
them using the instructions from the earlier study. The instructions and stimuli 
were presented to the subjects on mimeographed sheets, as in the previous stud- 


ies: 4), 

The trained subjects were 31 professional clinicians from the previous studies. 
All had the Ph.D degree and a minimum of four years job experience at the time of 
their original selection. AJ] 31 judged on the intelligence dimension, 15 on the 
dimension of communicability, and 16 on the concrete-abstract dimension. The 
naive subjects were 90 undergradutes in psychology courses at Northwestern Uni- 
versity ; 30 judging on the general schizophrenic dimension, 30 on intelligence, and 
15 each on communicability and concrete-abstract. The students were randomly 
assigned to the differing conditions by randomizing the distribution of the rating 


sheets. 


RESULTS 


The data on reliability were obtained by comparing each subject’s judgments 
with the mean of the group, using Pearson’s r. In each case where groups of only 15 
were used a correction has been made to take account of the individual’s contribu- 
tion to the mean of the group. To conserve space we shall merely present the range 
and median reliability coefficient for each series of judgments. These are given in 
Table 1. Inspection of this table shows that for the clinicians, while reliability tends 


RELIABILITY COEFFICIENTS (EAcCHJUDGE WITH GROUP) FOR 
CLINICIANS AND UNDERGRADUATES 


TABLE 1. 








Factors 


High 


Clinicians 
Median 


Low 


High 


Naive 
Median 


Low 





Schizophrenia 
Intelligence 
Communicability 
Concrete-Abstract 





94 
.83 
91 
71 


.88 
.70 
.80 
.55 


63 
.22* 
.70 
.38 





.89 
.83 
85 
.73 


72 
61 
75 
-60 


— .50 
.14* 
.39 

— .54 





*With these two exceptions all r’s were significant at the .01 level. 
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to diminish on the more ‘specific’ dimensions, with the possible exception of the 
abstract-concrete scale, the reliability remains satisfactorily high and indicates the 
potential value of such scales in objectifying clinical judgment. The falling off in 
reliability is particularly noticeable in the increased range of reliabilities which in- 
dicates that some clinicians are finding the new tasks particularly difficult. An in- 
spection of the performance of individual subjects, however, shows that the drop in 
performance is not consistent over all the scales for any one individual. 

The same picture is presented by the undergraduates. The medians are some- 
what lower than those for the clinicians, and the range is greatly increased. The 
negative values for the lower ranges on ‘“‘schizophrenic”’ and “‘concrete-abstract’’ are 
deceptive as they are contributed by only one and two cases respectively. With the 
exception of concrete-abstract, none of the mean values assigned to the scales by the 
undergraduates are statistically significantly different from those of the clinicians, 
but the standard deviations are significantly larger for the undergraduates in all 
cases, as we have found previously). So far it looks as though the undergraduates, 
while not as reliable, are fairly close in performance to the clinicians. We then ran a 
rank order correlation between the mean values assigned each stimulus by both 
groups. For schizophrenia it was .95, for intelligence .83, for communicability .94, 
but for the concrete-abstract dimension it was negative, —.43. There is obvious dis- 
agreement on this last dimension. The picture of disagreement becomes sharper 
when we study the interrelationship depicted in Table 2. 


TaBLE 2. RANK ORDER CORRELATIONS BETWEEN SCALES 








Clinicians Undergraduates 





Schizophrenia & Intelligence —.1l — .52** 
Schizophrenia & Communicability — .90** — .84** 
Schizophrenia & Concrete-Abstract — .28* .66** 
Intelligence & Communicability .23 7" 
Intelligence & Concrete-Abstract .79** .58** 
Concrete-Abstract & Communicability .34* ie 





significant at .05 level 
significant at .01 level 


+e 

The interrelationships between the four scales were ascertained by applying 
Spearman’s rank order correlation using the mean group value assigned to each 
stimulus. These correlations are presented in Table 2. For the clinicians we find no 
relation between the “schizophrenic scale’’ and “‘intelligence’’, a significant minimal 
negative relation to “concrete-abstract”’, and a significant high negative relation 
with “communicability.’’ This last raises the question as to whether the basis for 
judgment on these two scales may not be identical. There is no relation between 
intelligence and communicability, and a significant high relation between intelligence 
and abstraction. The correlation between abstraction and communicability is sig- 
nificant but low. These results indicate that with the possible exception of the 
scales for schizophrenia and communicability, our clinicians are differentiating be- 
tween scales, indicating there are independent bases for judgment. 

In general our findings are consistent with the conventional picture of schizo- 
phrenic thinking, with the possible exception of the low negative relation between 
“concrete-abstract” and ‘“‘schizophrenic’’. In view of the historic claims for con- 
creteness to be an important aspect of schizophrenic thinking we might have ex- 
pected this minimally significant negative relation to be much higher. Our data re- 
semble the results of McGaughran and Moran? who found inability to communi- 
cate to be more closely related to schizophrenic thinking than a tendency either to 
concreteness or to abstraction. 

For the undergraduates the picture is quite different. They are in agreement 
with the clinicians only on the high negative relation between schizophrenia and 
communicability. Again this confirms the suspicion that these two scales may be 
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measuring the same aspect. All the other relationships differ markedly from those 
for the clinicians. It is obvious that while the undergraduates may agree among 
themselves they are not judging on the same bases as are the clinicians. It appears 
significant that for the undergraduates all the dimensions are related, either posi- 
tively or negatively, as shown by the high correlations. The undergraduates do not 
appear to be able to distinguish clearly between scales. The suspicion arises that 
when called on to differentiate between these various aspects of schizophrenic think- 
ing the group as a whole is unable to do so and tends to fall back on some common 
denominator as a basis for all their judgments. In this respect it is interesting to 
note that in an earlier paper © we found that undergraduates showed correlations of 
from .88 to .97 when asked to judge on the dimension of “peculiarity” as well as 
“schizophrenic’’. One suspects some common implicit scale such as peculiarity or 
bizarreness may be responsible for the high interrelationship shown by the under- 
graduates on the present scales. 


SUMMARY 


This study compares the reliability of trained clinicians with that of naive 
undergraduates when each group is asked to judge schizophrenic vocabulary res- 
ponses on the general dimension of ‘‘schizophrenic,” and on specific dimensions of 
“potential intelligence,” “ccommunicability,” and “concrete-abstract”. While the 
clinicians exhibit less reliability on the specific dimensions than on the general one, 
their performance remains sufficiently good to indicate the value of such scaling 
methods in clinical practice. When the undergraduates move from the general 
dimension to the more specific ones the evidence indicates that they are unable to 
differentiate clearly between the separate dimensions. The interrelationships be- 
tween thefour scales used are congruent with the picture of schizophrenic thinking. 
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JUDGING RORSCHACH RESPONSES: AN ILLUSTRATIVE PROTOCOL 
ROY M. HAMLIN AND WILLIAM T, POWERS 
Western Psychiatric Institute, University of Pittsburgh 


Introduction 


Can the clinical judge identify, at least in a preliminary way, the cues on which 
his judgments are based? Incidental attention has been given to this question in 
several previous studies by the authors and co-workers. For the most part, analysis 
of judges’ statements has not been published ®? or has been mentioned only incident- 
ally along with other results“: *’. Procedures for eliciting and classifying such state- 
ments by judges have not seemed satisfactory. 

This paper presents a single illustrative judge protocol, based on Rorschach 
responses. The material includes the subject responses on which the judgment was 
based, the judge’s running comment on the procedure she followed, the final judg- 
ment, and the like. The method used in obtaining the protocol is described, with a 
brief comment on the possible value of such protocols in research. 


METHOD 


A representative protocol was selected on an unbiased basis from 100 similar 
protocols, obtained as follows: Three experienced clinicians, each of whom had con- 
ducted and participated in somewhat similar previous studies, made 50 judgments 
each. They first made 25 judgments each where two units of behavior were pre- 
sented at a time—one unit the responses of a psychotic to a single Rorschach card, 
the other the responses of a non-psychotic. The responses included Inquiry, time 
notations, and minimal comment indicating that the S laughed, emphasized a word, 
looked at the back of a card, or the like. For each pair, the judge wrote from one to 
two pages of notes (roughly an average of 200 words), giving a verbal report of: (a) 
his reasoning as he proceeded; (b) the specific elements influencing his developing 
choice; (c) his choice; and (d) level of confidence. After each judgment, he was given 
the correct answer for the given pair. The three judges then proceeded with 25 more 
judgments, but in collaboration, again reporting their joint discussion, choice, and 
level of confidence. 

The subject responses on which the judgments were based were drawn from 
material described elsewhere“: 4», In the 100 judgments referred to here, half of the 
non-psychotic Ss were neurotic and half socially adjusted. The psychotic Ss were all 
diagnosed as schizophrenic. Mixed and questionable diagnoses were eliminated be- 
fore the Rorschachs were administered. The subject groups were comparable in 
age, sex, education, and intelligence. Ss with 1Q’s of less than 100, and Ss who gave 
fewer than 15 Rorschach responses were not included. 

The pair of subject responses presented here was selected at random from the 
pairs judged correctly by one of the three judges at the third, or middle level of con- 
fidence. There was no other basis of selection. This pair of responses was then pre- 
sented to one of the other judges, who had not previously seen this pair, and her 
running comment was taken down as she gave it orally, providing somewhat greater 
— than the running written comment. The oral comment is the one presented 

ere. 


RESULTS 


The judges made a combined total of 100 judgments. The number of correct 
and incorrect judgments at each level of confidence is indicated in Table 1. Con- 
fidence level #1 includes each choice, out of every consecutive five choices, which 
the judge indicated as the one in regard to which he was most confident. The results 
for these judges should not be considered representative for clinicians in general. 
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TaBLE 1. NwuMBER OF CORRECT AND INCORRECT JUDG- 
MENTS AT Eacu LEVEL OF CONFIDENCE 





Confidence No. correct No. wrong Total 








THE ProtTocou 


The two Ss responses to Card II, and the judge’s running comment follow. The 
jadge’s choice for this pair was rated at the third, or midddle level of confidence. 
Subject A: Card II 38” - 120” 
Oh Golly. 
38” 1. Look like two dogs (laughs). These 


should remind me of something, but 
I don’t know what (laughs). 


1. (Important) (Look like)... Oh. . 
take one? Got a little pug nose (laughs) and ears, 
and this looks like his paws . . . and he sorta has 
the coloring of a dog. (How much?) Upper half. 


2. (Look like) Don’t ask me what kind. (Look 


. Can I just 


2. Looks like two little bugs or some- 


thing . . . Are you supposed to des- 
cribe them? (Mean?) Supposed to go 
into detail? 

3. Looks like two birds there. 


like) Well. . . I don’t know, the stripes and little 
legs (parts?) faces and little legs (laughs). I don’t 
know what this is! It’s his hat. (Laughs). 

3. (Look like) They have a beak, eyes, and, uh, 


wings... Am doing it right? 
120” 


Subject B: Card IT 10” - 2'0” 

10” 1. Now this looks like 2 dogs, noses 
darn near rubbed up to each other. 
Dogs, or some kind of animals. Got 
their noses into something. (Else) I 
don’t know what they’d be eatin’. 
Probably feedin’ on some dog food or 
something. 

Looks like the dogs, 2 hind feet up 

and their heads down. 


1. (Feedin’) Possibly, or maybe drinking. 


Same response. (Card inverted.) 
2’0” 


Judge’s comment: In this illustration, the judge proceeds systematically through 
the initial responses of Subject A, then through the Inquiry, and finally through the 
initial responses of Subject B: 


Subject A: From the first few words, I would say this is a girl, possibly between 18 and 25 
years old. More “hysteric’”’ that is, non-psychotic. She says, “Oh, Eoiy! ook like two dogs’, 
then laughs after this very common response. She’s uneasy. Probably the examiner was a man. 
be pe she freezes: ‘‘should remind me of something’’ and ‘“‘don’t know’’, then laughs again: sort 
of coy. 

“Two little bugs or something”, then ‘‘are you supposed to’’. She is leaning dependently, 
wants the examiner to participate in the game more directed toward establishing interaction 
with the examiner; not psychotic uncertainty. Then, ‘‘supposed to go into detail’. She’s stalling 
too, of course. ‘“Two birds’’; content (is not very adult): dogs, bugs, birds. 

Again here: “Oh... Can I just take one?’’. She’s being playful, provocative, teasing. “Little 
pug nose’: she’s putting on a little girl act. If this is a male subject, the handling would be bad. 

ut the content and verbalizations are congruent: I stick to the assumption she’s a female. 


This, “he sorta has the coloring of a dog’’: she’s relaxed enough, in — of her questions and 


bids for sup ort. She is comfortable enough. ‘‘Don’t ask me!’’ — the helpless little girl role. And 
“stripes’’, Tittle legs’’: — emphasize female, immature element; flirtatious, a bit. “I don’t know 
what this is’’, then with delight: ‘‘his hat!’’. She’s playing dumb, making a game of it, giving the 
examiner a little difficulty. She probably thought of these things she ‘discovers’ right from the 
start. Then this “beak, eyes, wings’’: hysteric, dependent element. A sort of Southern Belle role: 
“Am I doing it right?’”’. This question isn’t psychotic uncertainty. It doesn’t have the psychotic 
‘I can’t do it’, or the flat ‘it is’ quality. 
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Subject B: First thing: “looks like two dogs’’, “darn near rubbed up to each other.’’ The second 
phrase is very peculiar, as though there were something strange in seeing dogs rubbing noses on 
this card. Most subjec ts would just say ‘two dogs with their noses together’ . And the “or some 
kind of animals’’ suggests vague perceptions. A non-psychotic might say ‘or they could be bears’. 
Then, ‘‘got their noses into something’’. The “something” indicates haziness or uncertainty about 
his own abllity to perceive. “I don’t know what they’d be eatin’: a gap in communication: he 
hasn’t said they were eating; the psychotic’s way of talking as though he was already communi- 
cated something, or as though the examiner already knows something. Then “‘feedin’ on some dog 
food’’ is very concrete: dogs are eating; eating what? dog food. The ‘‘or something”’ is very vague 
and indefinite. (In contrast) the ‘dogs eat dog food’ is too concrete. 


I’m probably wrong on this pair. Both subjects see dogs, both start with a popular response. 
If I’d looked at the second subject’s responses first, I might not be so sure. But I just can’t see the 
first subject as psychotic. 


DISCUSSION 


In this example, the judge selects the non-psychotic tentatively, but correctly, 
on the basis of the first few words. This rapid, often correct, “hunch” was character- 
istic in the majority of the 100 choices. The judge then builds up several general 
inferences: (a) the non-psychotic is reacting to the examiner with affect, and with 
patterned defenses; and her expressions of uncertainty are related to this pattern; 
(b) the psychotic is uncertain of his perceptions; (c) the psychotic becomes ‘‘typi- 
cally” both vague and concrete in groping for details to elaborate his responses; and 
(d) the psychotic cannot keep track of what he has communicated to the examiner. 

The illustration suggests that an experienced clinician may draw on a wide 
variety of cues, many of which show little relation to traditional Rorschach scoring 
categories. Long acquaintance with the Rorschach may guide or check this judge in 
emphasizing cues, but her actual comments draw chiefly on ‘‘expressive”’ behavior. 
The judgment does not depend on an interpretation of the projection of unconscious 
factors, nor on classical indicators of personality structure. It is hardly necessary 
to point out that the task did not permit overall Rorschach interpretation. 

A wide variety of cues may contribute to a clinical judgment, and these cues 
may not be identical from case to case, or even from judge to judge. Judges who are 
receiving constant correction of errors after each choice may become particularly 
competent in formulating precisely what cues they use and how they discriminate 
these from other behaviors which are superficially similar. Careful analysis of 
judge protocols may help identify cues which can then be reduced to reliable scoring 
and psychometric analysis. Other features of the judgment process may not permit 
traditional psychometric parallels, but can encourage the development of new de- 
signs for research analysis. 


SUMMARY 


The paper presents an illustrative judge protocol: the running comment of a 
judge on her procedure as she makes a clinical judgment. The method and instruc- 
tions followed in obtaining the protocol are described, with emphasis on the cor- 
rection of errors after each choice. The research value of such protocols, in studying 
the process and optimum use of clinical judgments, is suggested. 
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Increasingly, systematic attention is being directed to the manner in which the 
psychologist communicates his clinical findings“ ™: % 1 17, The concern of this 
paper centers about an aspect ot this problem—the failure of some psychologists to 
individualize the client in their clinical reports, that is, to describe the client so that 
the reader knows how he differs from other people and in what important ways he is 
similar to others. Davenport, for example, found that psychologists had difficulty 
in selecting TAT interpretive statements, taken from case reports on a sample of 
men, which could be used to discriminate among patients. Many of the statements 
could be applied to any adult male. 

This peculiarity of interpretive-type comments may readily be put to nefarious 
use as noted by Paterson®: ». 47) who presented a ‘‘Character Reading at Sight of 
Mr. X. According to the System of Mr. P. T. Barnum.” The essence of the Barnum 
method is to describe a personality by using a few mildly negative generalities which 
are quickly neutralized in a matrix of acceptable, even flattering remarks, both 
types of comments being apparently applicable to almost everybody. This sort of 
report is thus likely to be readily ‘‘validated’”’ by the subject about whom the gener- 
alized description allegedly is written. People are gullible this way“ ™, hence 
Meehl has referred to such description as the ‘‘Barnum Effect.’’ The phenomenon is 
also demonstrated by the other arts in the same category, such as palmistry, graph- 
ology, tea leaf reading, and astrology’; and the revelations in the thirty-five cent 
horoscope stemming from the latter discipline have been compared with certain 
classes of interpretive remarks which may be derived from the Rorschach“. Not 
to be outdone by some of the serious students of projective tests who use their tech- 
niques to gain Barnum-type material, Holt “*? has published a valid-sounding TAT 
interpretation of a patient prior to seeing patient or protocol. Similarly, Dr. Norman 
Sundberg prepared an unpublished clinical report which he labeled ‘‘Completely 
Blind Analysis of the Case of a Schizophrenic Veteran.”’ It was “written before 
knowing anything about the patient except that he was a new admission to the Rose- 
burg V(eterans) A(dministration) H(ospital) and was to be worked up”’ for a sched- 
uled staff meeting. 

Quite similar to the Barnum phenomenon is what might be called the Aunt 
Fanny description in clinical reports. Superfluous statements, such as, “This client 
has difficulty in performing at optimal capacity when under stress,’’ or “‘The client 
has unresolved dependency feelings,’”’ or ‘“The client has unconscious hostile urges’’ 
might readily prompt the report reader to think “So has my Aunt Fanny!” The 
nondifferentiating Aunt Fanny description may be complimentary to the client, un- 
complimentary, or neither, and it also differs from the Barnum method in that it is 
not used with the intention of satisfying a client. 

Another commonly noticed offense against individualized reporting is seen in the 
trade-marked report. This name is proposed for reports which may communicate the 
psychologist’s personal concerns, interests, conflicts, shortcomings, or dynamics 
rather than those of the client. Where the psychologist reports in this manner his 
clients tend to appear to have gross similarities of personality make-up. The exist- 
ence of this phenomenon has been shown by several workers, ®: 5 8: 10, 1) 

Still another type of nonindividualized report is the prosecuting-attorney brief— 
virtually the reverse side of Barnum’s coin. After listening to such a report at a staff 
meeting, the listener might be tempted to call out “Now will the attorney for the 
defense please come forward!” Such reports are saturated with the negative dyn- 
amics the psychologist reads about while in graduate school, but give little or no 
attention to positive features, to commendable conscious strivings, socially valuable 
compensations, and other well-used defenses. These reports consistently reflect the 
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motto “Always interpret at the lowest possible level of psychosexual fixation or re- 
gression.’’ They are prepared by psychological simians who hear no good, see no 
good, and report no good. 

Honesty and sober reflection certainly are necessary if Barnum’s method is to 
be eliminated from psychology. Such an attitude should also tend to keep descrip- 
tions of Aunt Fanny to a minimum. A little more insight into the self, and perhaps 
greater interpretive acumen, would be the prescription for the psychologist who 
trade-marks his reports or writes prosecuting-attorney briefs. 

There are several considerations which apply to individualizing a client in a 
psychological report. These may be used as guides for the psychologist to the extent 
his data permit. 1. Consider how important it is to mention a finding which is true 
of many persons. About one-half of the population is of average intelligence, yet it 
may be important to mention in a report that ‘‘average intelligence’”’ is descriptive 
of the client under consideration. 2. Show the unique aspects of a trait. It is mean- 
ingless to just indicate, for example, a trait of dependency. We all have it. What is 
important is how the trait is manifested. 3. The significance of a trait in an individual 
depends partly on its strength. Adjectives indicating strength should be used liber- 
ally. 4. The level(s) at which behavior occurs should be made explicit. ‘‘Hostile’’ is 
a word which has different meanings as the significant dynamic in a social reformer, 
the smarting innuendo of the society matron, and the outbursts of the explosive 
psychopath. 5. The circumstances under which behavior becomes overt, or in some 
way significant, should be clear to the reader. Divorcing a report from the life situa- 
tion is an effective way of keeping it general. 6. The manner in which traits interre- 
late should be specified. Relating hostility to unresolved dependency may be more 
meaningful than to simply indicate hostility and dependency in various parts of the 
report. 7. Avoid a “shotgun effect”’, i.e., including every possible finding and thus 
precluding the possibility of having omitted something. Always ask ‘“‘Will this state- 
ment help my reader to understand the client as an individual?” 


SUMMARY 


The need for clinical psychological reports to more effectively portray the in- 
dividual characteristics of the client was examined. Four types of report writing 
which defeat this purpose were discussed and remedial measures were suggested. 


REFERENCES 


1. Davenport, Beverty F. The semantic validity of TAT interpretation. J. consult. Psychol., 
1952, 16, 171-175. 
2. Duwnnerre, M. P. Use of the sugar pill by industrial psychologists. Amer. Psychologist, 1957, 12, 
223-225. 
3. Fiier, R. N. The clinician’s personality and his case reports. Amer. Psychologist, 1952, 7, 336. 
4. Forer, B. R. The fallacy pt premiere validation: a classroom demonstration of gullibility. J. 
abnorm. soc. Psychol., 1949, 44, 118-123. 
5. Hammer, E., and Prorrowskt, Z. A. Hostility as a factor in the clinician’s personality as it 
affects his interpretation of projective drawings (H-T-P). J. proj. Tech., 1953, 17, 210-216. 
6. Hammonp, hk. R. and ALLEN, J. M. Writing clinical reports. New York: Prentice-Hall, 1953. 
7. Kewiey, D. M. Clinical reality and projective technique. Amer. J. Psychiat., 1951, 107, 753-757. 
8. Marcuss, F. L. Projection, 1953. Amer. Psychologist, 1955, 10, 43. 
9. Buu, M. L. and Batinsky, B. Counseling and psychology. New York: Prentice-Hall, 1951. 
10. Rosrnson, J. T. Some indications of personality differences among clinical psychologists as 
revealed in their reports on patients. Unpublished M.A. thesis, Duke University, 1951. 
11. Rosrtnson, J. T. and Conen, L.D. Individual bias in psychological reports. J. clin. Psychol., 
1954, 10, 333-336. 
12. SHNEIDMAN, E.S., Bett, J. E., Fine, R., Hout, R. R., Kurasn, 8. B., and Vornaus, PAULINE G. 
The case of Jay: interpretations and discussion. J. proj. Tech., 1952, 16, 444-475. 
13. SunppereG, N. D. The acceptability of “fake” versus “bona fide’’ personality test interpreta- 
tions. J. abnorm. soc. Psychol., 1955, 50, 145-147. 
14. Tatient, N. An approach to the improvement of clinical psychological reports. J. clin. Psychol., 
1956, 12, 103-109. 
15. Taytor, Janet L. and Tercuer, A. A clinical approach to reporting psychological test data. 
J. clin. Psychol., 1946, 2, 323-332. 
16. THorne, F.C. A new outline for psychological report writing. J. clin. Psychol., 1956, 12, 115-122. 
17. Thorne, F. C. Principles of psychological eramining. Brandon, Vt.: Journal of Clinical Psychol- 
ogy, 1955. 
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In the spring of 1956 a survey was conducted in 63 Veterans Administration 
mental hygiene clinics to obtain information concerning the judged effects and 
benefits of outpatient treatment for psychiatric and neurologic disabilities. More 
specifically, this study sought to secure four types of data: (a) the frequency of treat- 
ment interviews for different types of psychiatric patients; (b) the length in minutes 
of the usual interview; (c) the extent to which outpatient treatment averts hospital- 
ization; and (d) the extent to which different types of patients are judged to benefit 
from treatment. 


THE SAMPLE AND THE CLINICS SURVEYED 


To secure a representative sample of the approximately 20,000 cases seen every 
month in VA mental hygiene clinics, each clinic prepared reports on two types of 
cases. Reports were made on all “closed”’ cases terminated for any reason during 
March 1956 which had three or more visits for psychotherapy since initiation of the 
current period of treatment. The “open” cases represented a random sample of all 
cases in the open files of the clinic, as of the close of the month of March 1956, which 
had three or more visits for psychotherapy since initiation of the current period of 
treatment. Open cases were defined as those which were active or inactive but not 
terminated in March. A random sample of open cases was obtained by examining 
the first case, and each succeeding third case. If the criterion established was met 
the case was selected. If the criterion was not met, the cases immediately following 
the rejected case were examined until a case meeting the requirement was selected. 
The aad sample concerns 5,367 cases of which 4,892 are ‘‘open’”’ and 475 are 
““closed”’. 

The 63 clinics included in the study are widely distributed geographically over 
the entire United States. They differ with respect to type and severity of illness of 
patients seen, size of case load, size of staff, and as to examination and treatment pro- 
cedure. Size of staff seemed to be an especially important factor since the case load 
size, the pattern of examination, staffing and treatment are directly affected. Ac- 
cordingly each clinic was categorized as ‘‘small,’”’ “medium,” or “large”. Twenty- 
one clinics with 3 or fewer professional personnel were designated small; 25 clinics 
with 4 to 9 professional people were labelled medium; 17 clinics with 10 or more 
professional staff were coded large. 


Tue Case REcorRD 


A case record form was sent to each of the clinics. An individual Case Record 
was filled in for each patient included in the survey including information on the 
following items: 

The primary clinic diagnosis. 

The number of weeks of treatment received during the current period of 
treatment up to March 31, 1956, or date of discharge. 

The number of treatment interviews: (a) for the current period, (b) for prior 
periods in the clinic, and (c) for prior periods in other clinics. 

The length (in minutes) of the usual treatment interview during the current 
treatment period. 

The scheduled frequency of treatment interviews. 

A judgment as to whether treatment averted hospitalization. 

Judgments in nine areas of maladjustment of degree of change in the patient 
since coming to the clinic. 

A judgment concerning the degree of change in over-all adjustment of the 
patient since coming to the clinic. 

9. The discipline of the reporting therapist. 
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Data for the clinic diagnosis and for all judgments were furnished solely by the 
therapist currently on the case who also reviewed the entire report. Typically, judg- 
ments of improvement are vague, global and nonspecific in nature. The therapist is 
asked to describe the patient as worse, unchanged, slightly improved, moderately 
improved, or extensively improved. As a consequence of this lack of reference, inter- 
rater agreement is low, and the findings are lacking in comparability. There is, for 
example, no way of checking whether the judgments made in one clinic are com- 
— as to content to those recorded in another clinic, or in another survey or 
study. 

In order to specify more concretely what was to be judged, nine scales were con- 
structed. Each represented a group of common problems and complaints found 
among psychiatric outpatients such as tension-anxiety, sense of personal inadequacy, 
interpersonal relationships and emotional control. The clusters chosen were taken 
from those repeatedly identified in the development of the MSRPP“?., The therapist 
was asked to consider each cluster of complaints as a totality and to indicate the 
degree of change observed in the patient since coming to the clinic. In the event that 
no judgment was possible because the patient did not complain of, or exhibit, this 
particular type of disturbance, a ‘does not apply”’ category was checked. 

In addition the therapist recorded a judgment descriptive of the patient’s over- 
all change since coming to the clinic. Such over-all judgments have been found to be 
useful when they are preceded by judgments concerning more specific details. They 
permit the rater to make an integrated judgment based on all information available 
to him. 


THE FINDINGS 
Number of Treatments and Interview Length. The number of weeks each patient was 
listed as being in treatment during his current period of treatment was categorized 
in two ways. First the clinics were grouped with respect to size. Then the cases were 
categorized as open or closed. As might be expected the range is wide. For open 
cases the averages for clinics run from 25 weeks to 170 weeks. 
TaBLe 1. AVERAGE NUMBER OF WEEKS OF TREATMENT DuRING CURRENT PERIOD; AVERAGE NUMBER 


or TREATMENT INTERVIEWS; AND AVERAGE LENGTH (IN MINUTES) OF TREATMENT 
INTERVIEW BY S1zE OF CLINIC FOR OPEN AND CLOSED CASES 























Average number 
Average number treatment Average interview 

Size of Number of weeks treatment interviews length 
clinic cases current period received (in minutes) 

open closed open closed open closed open closed 
Small 724 108 | 79.0 44.8 34.9 17.7 48.7 43.9 
Medium 1583 161 103.0 48.5 36.8 23 .0 47.9 51.5 
Large 2585 206 | 105.0 57.7 50.9 29.3 44.1 48.9 

| g 





Table 1 shows that for both open and closed cases the number of weeks a patient 
is seen in treatment is a function of the size of the clinic. The larger the clinic the 
longer the total period of treatment. Translated into years, it can be said that the 
typical open case in the small clinic is seen for 114 years, and for about 2 years in the 
medium or large clinic. For the closed case the typical period of treatment is from 9 
months to a year. It should be noted here that these values refer to cases that stay 
in treatment beyond three visits. It is known that between 30% to 50% of all cases 
admitted for treatment terminate within the first three treatments. 

The number of weeks a patient is listed as being on the rolls in treatment can be 
very deceptive. Patients may actually keep only a fraction of their treatment ap- 
pointments. Cases are kept open for undetermined periods for vacations, temporary 
terminations, and a variety of other reasons. The number of treatment interviews a 
patient receives is a more accurate picture of treatment administered by the clinic. 
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An examination of Table 1 shows that the number of treatment interviews re- 
ceived by a patient increases with the size of the clinic. The typical open case in the 
small clinic is seen for 35 treatments. In the large clinic this rises to 51 treatments. 
The typical closed case is seen about half as frequently. Again it should be noted that 
the range within any category is considerable. It runs from an average of 13 treat- 
ment interviews in one clinic to an average of 84 treatment interviews in another. 

In order to compare clinics differing in size, the average length in minutes of the 
treatment interview was computed separately for each clinic. The range for all 
clinics is from 30 minutes to 68 minutes for an interview. As can be seen from Table 
1, in small and medium sized clinics the average interview for open cases runs 48 or 
49 minutes. In the large clinic the average time drops to 44 minutes for such cases. The 
picture is not as clear for cases closed during the survey month. Perhaps this is be- 
cause a smaller number of cases is involved or because the reasons for closing are 
much different in the various clinics. The typical interview of the closed case in the 
small clinic runs about 44 minutes, and about 50 minutes in medium and large 
clinics. 


Judged Patient Change. The average degree of change in a patient was computed by 
dividing the sum of his rating on the types of disturbances or complaints by the 
number of such ratings. This procedure compensates for omitted ratings where the 
therapist found himself unable to record a judgment on a particular scale. Weights 
from 1 to 4 respectively were assigned to judgments of change. Thus an average 
score of 1 means that the patient is worse; 2 means that no change occurred; 3 means 
some improvement; 4 means considerable improvement. 


Tas_eE 2. THe AVERAGE DEGREE OF RatTep CHANGE IN PATIENTS 











Open cases Closed cases 
Clinic size Average No. of Average No. of 
change cases change cases 
Small | 2.78 724 2.65 108 
Medium | 2.85 1583 2.62 161 
Large | 2.80 2585 2.64 206 
| 








The range of change scores reported for open cases shown in Table 2 does not 
vary with size of clinic. The ratings extend from 2.5 to 3.0. There is also negligible 
variation in mean clinic ratings with respect to size of clinic for the open cases. The 
typical open case is rated about 2.8, which indicates moderate improvement. In- 
spection of the means for the closed cases reveals a wide range of ratings extending 
downward. This suggests that a high proportion of closed cases represent patients 
who have shown only slight or no improvement as judged by these scales. The aver- 
age closed case is rated 2.6 as compared to 2.8 for open cases—a small but consistent 
difference. 

A rating for each patient was also obtained on a five-category scale of over-all 
change. The current therapist was instructed to consider any factors regarded as im- 
portant to the patient’s total personality adjustment, to evaluate them, and then to 
rate the patient as to the degree of change observed since the latter began coming to 
the clinic. 

Let us first consider over-all change when open cases are classified as to type of 
psychiatric disorder. Some of the findings are presented in Table 3. The range of 
means is narrow. It is evident that the typical patient in all diagnostic classifications 
is rated as showing slight or moderate improvement. Statistical study of the differ- 
ences indicates that the involutional and affective reactions, the psychoneuroses and 
the psychophysiological disorders do not differ from each other. However, they are 
significantly more improved (.01 level) than the schizophrenic reactions and the 
personality disorders. This is certainly congruent with clinical judgment. 
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Another approach is to determine whether any differences exist between these 
diagnostic groups with respect to the numbers showing any positive change. The 
range is from 75 per cent for personality disorders to 85 per cent for the psycho- 
neurotic disorders. Actually, however, only the percentage of schizophrenic re- 
actions and personality disorders is significantly less than the other diagnostic groups. 


TABLE 3. AVERAGE OVER-ALL DEGREE OF CHANGE OF PATIENTS 
BY D1aGnostic Groups 





Percent showing Percent showing 
Mean an moderate or 


y 
Type of disorder change positive change extensive change 





Psychoneurotic 3.37 of 43.7 
Psychophysiologic 3.30 , 41.2 
Psychotic - Affective 3.37 : 47.6 
Schizophrenic & Paranoid 3.13 r 31.7 
Personality 3.14 j 34.8 





Size of Clinic. The over-all change ratings were examined for differences with respect 
to size of clinic. The mean over-all change was rated 3.24, 3.34, and 3.25 for small, 
medium, and large clinics respectively. Evidently size of clinic plays little role in the 
amount of improvement reported. 

It seemed also worthwhile to explore the relation between the over-all change 
rating received by a patient and the average change rating based on nine areas of 
illness. Are these judgments essentially the same or do they reflect novel elements in 
the over-all judgment? Using a one-third sample of the reporting clinics, tetrachoric 
correlation coefficients were computed between these two ratings. For small clinics 
the correlation was .96, for medium sized clinics .90, and for large clinics .90. These 
results suggest that there is substantial agreement, regardless of clinic size, between 
over-all and average change ratings. 

Finally the diagnostic groups were compared with respect to per cent of patients 
showing moderate or extensive improvement. The findings again separated at signi- 
ficant levels the schizophrenics and personality disorders from all other groups but 
not from each other. 


Scheduled Frequency of Treatment. The scheduled frequency of treatment for open 
cases and by size of clinic is presented in Table 4. This item is of obvious interest as 
reflecting current treatment practice. Inspection of Table 4 reveals that the per- 
centage of patients seen once a week increases steadily with size of clinic. On the 
other hand, the per cent of patients seen twice a month decreases consistently with 
size of clinic. 

It seemed also of interest to compare clinics with respect to the total open cases 
seen once a week or more. Typically such patients are regarded as receiving a more 
intensive form of treatment. The percentage rises with size of clinic from 34% to 
38% to 48% of patients. The differences are significant (P < .001) except for the 
difference between small and medium sized clinics. It should also be noted that at 
least one-third of the clinic load is seen only once a month or less. The data for 


TaBLe 4. ScHEDULED FREQUENCY OF TREATMENT FOR OPEN CASES 





Small clinics Medium clinics Large clinics 


Scheduled No. of % of No. of No. of % of 
frequency cases cases cases cases cases 





More than once 

a week 37 ; ; 191 
Once a week 212 . ‘ 1057 
3 times a month 33 : 3 72 
2 times a month 169 ; . 372 
Once a month 

or less 273 : : 893 
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closed cases indicate again that the larger the clinic the more frequently patients are 
scheduled for treatment once a week or more. The percentages climb from 40% for 
the small clinic to 64% for the large clinic. 

Do patients scheduled more frequently tend to belong to particular diagnostic 
groups? As indicated previously it was found that schizophrenic and personality 
disorders were rated as showing less change than the remaining diagnostic groups. 
Using this two-fold breakdown, diagnosis was correlated with scheduled treatment 
frequency which was also dichotomized near the median. Chi-square tests showed 
that regardless of clinic size there is no relationship present. 


Effect of Treatment on Hospitalization. Each therapist was asked to indicate whether 
in his judgment treatment averted hospitalization. He could answer ‘“‘yes,”’ ‘“‘no,”’ 
or “not yet determined.’’ Unfortunately the question was ambiguous since a ‘‘no”’ 
answer could mean (a) that the patient was hospitalized because treatment failed to 
avert it, or (b) that the problem of hospitalization was of no importance in the case. 
On the other hand a “‘yes’’ response does indicate that treatment was considered to 
have averted hospitalization. 

The percentage of ‘‘yes’’ answers of all replies made was separately computed 
for each clinic. Clinic variation in response was marked. The percentage ran as low 
as 16 and as high as 100. In about 54% of open cases the therapist indicated that 
treatment may have averted hospitalization. Clinic size appears to have little effect 
on this percentage. As might be expected, in only 34% of closed cases does the 
therapist report that treatment averted hospitalization. This per cent is consistently 
lower than for open cases regardless of size of clinic. 

A related question is the type of case for which clinic treatment is helpful in 

averting hospitalization. There is general agreement that treatment is most effective 
in averting hospitalization for schizophrenics (64%). Then in rank order follow the 
affective psychoses (63%), the psychoneuroses (53%), the personality disorders 
(53%), and finally the psychophysiological cases (50%). Only the smaller clinics, 
perhaps because few cases are involved, differ as to the rank order of the last three 
diagnostic groups. 
The Therapist and his Case Load. Each therapist completing a Mental Hygiene 
Clinic Case Record indicated his profession and whether or not he was a member of 
the staff. Of all open cases 92% are carried by staff personnel. Likewise, of all closed 
cases 91% are carried by the staff while the remainder are treated by psychiatry 
residents, psychology trainees, and social work trainees. However, while the psy- 
chiatrist carries 57% of all open cases he carries only 45% of the cases closed. The 
psychologist and social worker thus carry proportionately larger numbers of closed 
cases. 

For open cases the percentage of cases carried by psychiatrists steadily increases 
with the size of clinic from 40% to 63%. Correspondingly the percentage carried by 
the psychologist decreases with size of clinic from 32% to 17%. The percentage car- 
ried by social workers’ decreases from 20% to 11%. The picture for closed cases is 
somewhat different. The per cent carried by the psychiatrist steadily decreases from 
small clinics to the large clinics. The social worker and psychologist load is variable. 

What types of patients do the various disciplines carry as their caseload? An 
analysis was made of the medium sized clinic case load. The differences are not very 
striking. Uniformly across disciplines approximately 80% of the case load are schizo- 
phrenics and psychoneurotics. However, the social worker carries proportionately 
more schizophrenics while the psychologist carries proportionately more psycho- 
neurotics. 


Number of Treatments and Improvement. Are patients who have been in treatment 
for some time rated as showing more improvement than those who have been in 
treatment only briefly? Both for open and closed cases the mean level of improve- 
ment rises sharply initially and then levels off by the end of 45 treatments. There 
appears to be a break in the curve and the mean rating drops between the 55th and 
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65th interview. It may be that after about one year certain of the more improved 
patients drop out and hence the mean rating goes down. The data thus suggest that 
rated improvement increases with number of treatment interviews received. The 
correlation ratio between these two variables for large clinics is .31, and .23 for med- 
ium size clinics. Both correlations are significant at the 1% level. 


SUMMARY 


In the spring of 1956 a survey was conducted in 63 VA mental hygiene clinics. 
Case record reports were obtained on 475 cases closed during the month of March 
and on a sample of 4,892 open cases. All cases had been seen in at least three psycho- 
therapeutic interviews since coming to the clinic. Data for all judgments were furn- 
ished solely by the current therapist. The finding may be summarized as follows: 

1. The number of weeks a patient is seen in treatment is proportional to the 
size of the clinic. The larger the clinic the longer the treatment. The typical open 
case is seen in treatment from 114 to 2 years. The typical closed case has been seen 
for 9 months to a year. 

2. The number of treatment interviews received by a patient increases on the 
average with size of clinic from 35 to 51 treatments for open cases. The number of 
interviews received by closed cases also varies with size of clinic but is roughly half as 
frequent as for open cases. 

3. The average length in minutes of the treatment interview decreases with 
size of clinic for open cases. The average interview runs for 49 minutes in the small 
clinic and for 44 minutes in the large clinic. The treatment interview of the closed 
case is of similar length but shows no consistent variation with clinic size. 

4. The typical open case is rated as showing moderate improvement, while the 
closed case is rated somewhat less on a set of nine scales concerning personality 
adjustment. 

5. The typical patient (both open and closed) is rated as showing slight or 
moderate improvement on the over-all scale of improvement. Patients diagnosed as 
psychoneurotic, psychosomatic, or with involutional or other affective reactions are 
rated significantly more improved than those diagnosed as schizophrenic, paranoid, 
or within the personality disorders. 

6. The percentage of open cases scheduled for treatment once a week or more 
increases with size of clinic. In the small clinic 34% and in the large clinic 48% of 
patients are thus scheduled. A similar increase is seen for closed cases. However, 
no particular diagnostic group is scheduled more or less frequently than other 
diagnostic groups. 

7. Although there is wide variation between clinics, on the average, in about 
54% of open cases and in 34% of closed cases the therapist indicated that treatment 
may have averted hospitalization. There is agreement regardless of clinic size that 
treatment is most effective in averting hospitalization for schizophrenics and least 
effective for psychophysiological cases. 

8. Approximately 92% of open and closed cases are carried by staff personnel; 
the remainder are treated by residents and trainees. The percentage of open cases 
carried by psychiatrists increases with size of clinic. Regardless of discipline about 
79% of the case load consists of schizophrenics and psychoneurotics. However, social 
workers carry proportionately more schizophrenics, while the psychologists carry 
more psychoneurotics. 

9. Rated over-all improvement increases with number of treatment inter- 
views received both for open and closed cases. The correlation ratio is .31 for large 
size clinics and .23 for medium size clinics. 

10. Schizophrenics are rated as showing least improvement, yet in terms of 
averting hospitalization treatment is more effective for this group than any other. 
Psychotherapy thus serves the function of preventing further deterioration and 
hospitalization for this group. 
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COMPARISON OF THREE METHODS OF HANDLING MENTAL 
HYGIENE CLINIC STATISTICS" 
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INTRODUCTION 


During the last ten years there has been a substantial increase in the number of 
mental hygiene clinics throughout the country. However, there has not been a cor- 
responding increase in concern in dealing with the data of mental hygiene clinic re- 
ports. If clinics are to be compared with each other, or if improvement in services 
rendered is to be demonstrated, it is imperative that statistics be collected in a man- 
ner that can be duplicated. The purpose of this paper is to contribute to this area in 
a two-fold manner: First, to compare findings obtained by three different methods of 
collecting clinic statistics, and secondly, to report the experience of one clinic in 
evaluating treatment in terms of success, as well as number of treatment interviews. 

One major reason for the dearth of statistics is the difficulty in reaching agree- 
ment, whether it be in defining the terms used, in categorizing the procedures em- 
ployed, and in classifying the results obtained. For example, when is a case a case? 
At the time of referral? At the completion of the intake processing? Or at the time 
he reports for his first therapy interview? If the last definition is considered, then 
other problems become evident. Certainly the results may differ if a clinic which 
simply assigned each new case in rotation after securing minimal information is 
compared with another clinic which obtained a complete case history, a physical 
examination, four hours of psychological testing, and psychiatric examination before 
accepting the case. 


PROBLEM AND METHOD 


This paper is based on the activities of a VA mental hygiene clinic in which a 
treatment case is defined as a veteran with a service connected disability who has 
gone through our intake process and agreed to enter treatment. The intake process 
involves spending one hour each with a psychiatrist, psychiatric social worker and 
clinical psychologist. The diagnosis and degree of improvement are obtained by 
agreement between the therapist and his consultant. All three disciplines do therapy. 

There are various ways of collecting clinic data. The most convenient method, 
and the one typically used by clinics, is to analyze the summaries of the cases closed 
during a stated period, usually a fiscal year. This method is herein called the “‘closed 
case’? method. A second procedure is to follow-up all cases opened in a stated time 
interval, until all or most are terminated. This procedure is designated as the 
‘follow-up’? method. A third approach is to collect relevant data on all cases in 
treatment on a particular day in the clinic. This procedure is referred to as the 
“survey” method. These are the three methods compared in this paper. But any 
other bases for comparison may be used, breakdowns of cases by age groups, by diag- 
nosis, by symptoms, by experience of the therapist and so on. This study is based on 
three different sets of data: (a) All cases closed during the fiscal year ending July 31, 
1956 (N = 208), closed case method. (b) All cases opened during the fiscal year end- 
ing June 30, 1955 (N = 258). The information about these cases was collected seven 
months after the close of the fiscal year (follow-up method). (c) All cases in treat- 
ment in the clinic on January 15, 1956 (N = 263), survey method. 


RESULTS 
It is not possible, by definition, to have identical population for all three meth- 
ods; however, there is considerable overlap. A comparison of these three methods 
shows some similarities and some differences. The cumulative percent of patients 
by number of interviews for the three methods is presented in Figure 1. The curves 
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for the closed case and follow-up methods are practically identical. About half of the 
cases have terminated treatment before the sixth interview; only 20 percent of the 
cases have had 20 or more interviews. The mean number of interviews for cach 
method is about 15. Although 19 percent of the cases in the follow-up method are 
still open, it is unlikely that the data from the closing summaries would change these 
statistics significantly. 
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Table 1 indicates the results of treatment in terms of improvement. Here, too, 
there is a striking similarity, about 55 percent of the cases are improved (it is not 
possible to get improvement ratings for the survey method since the cases were all 
still in treatment). It would appear that for at least these two aspects of clinic work, 
the follow-up and the closed case method can be used interchangeably. 

The curve for the survey method, the third curve on Figure 1, has the same gen- 
eral shape of the other two, but is much flatter. The differences are pronounced. The 
median number of interviews for the survey method is 10.5 as compared to about 5 
for the other two methods; the mean jumps to 29 as compared to 15; the survey 
method reveals that 36 percent of our active cases have had 20 or more interviews 
whereas the other methods indicated only 20 percent had had that number of inter- 
views. It would appear then, that while there is a considerable number of short term 
cases, there is a gradual build-up of long term cases. Actually, a major part of our 
daily work is spent in treating cases that remain in therapy for 20 or more interviews. 
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Each of the three methods contains some unique features. The method of 
choice, of course, would depend on the data sought. For example, Table 1 presents 
the percentage of patients, by diagnosis, remaining in treatment 7 months after the 
close of the fiscal year (the follow-up method). It is obvious from this table that 
patients carrying the diagnosis of anxiety reaction are terminated very much more 
rapidly than patients with any other diagnosis. Only 9 percent of the anxiety re- 
actions remain in treatment 7 months after the close of the year, for the other diag- 
noses the percent remaining in treatment runs from 19 to 25. If these figures are con- 
sistent from year to year, we would then expect a build-up of our case load in cate- 
gories other than anxiety reactions. Unfortunately, the survey method data were not 
collected in a form to substantiate this hypothesis fully, but do so at least in part. 

The percentage of patients by diagnostic category for each of the three methods 
is presented in Table 2. Several interesting points may be noted. Apparently, a 


TABLE 2. DIAGNOSES FoR Eacu METHOD 
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Method Character Disorders Psychotic Organic 
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Cases Closed (N = 204) 68 5 2: 20 10 
(4 cases not diagnosed) 


Follow-up (N = 258) ‘ 50 y 28 10 
Survey (N = 263) 54 51 : 69 26 




















change has occurred in the diagnoses of the patient population. The follow-up 
method indicates that 40 percent of the cases received during the year were diag- 
nosed as psychotic. This is twice as high as either the closed case method or the sur- 
vey method indicating that perhaps a new trend has been established, in that more 
psychotic patients are coming to the clinic. The increase in the new number of 
psychotic patients may be due to a variety of reasons, such as the effect of the tran- 
quilizing drugs and better liason with nearby hospitals. The other outstanding find- 
ing is that, from the survey method, we find 26 percent of our patients have organic 
diagnoses. This is about two and a half times as many as reported by either of the 
other two methods. From the follow-up method we learn that only 1 in 4 remained 
in treatment seven months after the close of the fiscal year. Hence, we can assume 
that if organic patients stay in treatment, they stay for a very long time. 


SUMMARY 

Three different methods of collecting mental hygiene clinic statistics have been 
presented and some of the results possible from each method have been discussed. 
The closed case method and follow-up method gave almost identical results as to the 
number of improved patients and the number of interviews. While the closed case 
method is easy to use, and is typically the basis of the annual report, the follow-up 
method provides a much clearer picture of the movement of the caseload. The survey 
method demonstrates more clearly what is actually being done in the clinic in terms 
of long-term cases and types of cases currently being handled. For example, these 
data show that the percentage of patients with organic diagnoses actually in treat- 
ment is two and a half times as great as either the number of cases opened or closed 
in a single year, and that the number of patients open at the time of the survey who 
were seen 20 or more times is nearly twice as great as those closed with this many 
interviews. These data could never be gleaned from either of the other two methods. 
It is suggested that these types of data be reported from other clinics for purposes of 
comparison. 
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The placement of a neuropsychiatric patient in the home of a paid caretaker is a 
technique that has been used in Gheel, Belgium since the middle ages, and has found 
a steadily growing use in this country for over 75 years. What is perhaps the strong- 
est endorsement of home care came from Rosanoff “°: °- ) when he wrote: “There 
is no doubt that at least 25 per cent of the population of the average mental hospital 
can be maintained in extramural care with great advantage to all concerned.”’ Classic 
works by Crutcher“? and Pollock “ offer comprehensive orientations to the history 
and technique of home care: articles that are especially pertinent to the present study 
are Cumming’s“? survey of home care in VA hospitals and Hollier and Harrison’s “ 
description of the early phases of the home care program from which the subjects of 
this research were drawn. 

The present study was designed to investigate the usefulness of psychological 
test reports in predicting the course of home care placement of neuropsychiatric 
patients. It was hypothesized that the ability to abide by rules, conformity, is a 
major factor in the success or failure of home care placement, and attempts have 
been made to measure and specifically report this capacity. The immediate question 
dealt with in this study was whether raters could derive enough valid information 
about conformity from test reports to enable them to predict the nature of adjust- 
ment and benefit from placement in home care. Because previous work“? had in- 
dicated that the number of years of NP hospitalization prior to placement was signi- 
ficantly related to the type of use patients made of home care placement, a second 
question needed to be answered: if valid information was gained from psychological 
reports, was this information worth adding to the information already obtained 
from knowledge of the length of previous NP hospitalization? 


METHOD AND PROCEDURE 


The basic data of this study were: (a) judgments of ability to conform based on 
psychological test reports; (b) ratings of patients’ conformity made by sponsors after 
the placement; (c) a score of the difference of per cent of time spent out of NP hospi- 
tals before and after home care placement; and (d) the number of years of NP 
hospitalization before home care placement. 


Test Reports. Of the 191 patients placed in home care between January 1951 and 
December 1954, from the VA hospital, Palo Alto, reports of psychological evaluation 
were available for 112 cases. These 112 test reports were classified as belonging to 
one of the three following samples: 


Sample I (N = 62). These reports fulfilled all the following requirements: (a) referral dealt 
with candidacy for placement in home care; (b) testing had occurred no more than 12 months 
prior to placement; (c) patients had been testable; (d) a standard battery had been used consist- 
ing of the Rorschach, four TAT cards, Porteus Mazes, MMPI, and a sentence completion test; 
and (e) reports had been written in a standard form which forced explicit comments about prob- 
lem areas and defenses. Among the topics specifically discussed in the reports was a section deal- 
ing with conformity. 

Sample II (N = 21). These test reports met all the requirements of group I with the 
following exceptions (a) testing had been completed more than a year before placement or within 
three months after placement or (b) there had been only partial adherence to either the standard 
test battery or the report form. 

Sample III (N = 29). These reports were based on patient contacts no more than a year 
before placement, but neither the standard battery nor report form was used. Four of these 
protocols were of patients who were not testable, and the report was based on interview material 
only. The remainder were routine reports of psychological evaluations. 


‘From the Psychology Research Laboratory, VA Hospital, Palo Alto, Calif. This study was com- 
pleted in association with the hospital’s Social Work Service, H. J. Wax, Chief, and the VA’s Psychia- 
tric Evaluation Project, Richard Jenkins, Director. 
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Judgments of Conformity Based on Test Reports. It had been hypothesized that con- 
formity was a major psychological variable related to success of home care placement. 
For samples I and II, sections of the test reports devoted specifically to conformity 
were utilized in judging the type of conformity. For the reports in sample III, the 
entire protocol was used as the source of material to be rated. ae 

Four types of conformity were arbitrarily defined. None of these descriptions 
was intended to be of good mental health, rather all were pictures of orientation to 
social rules frequently exhibited by the sort of NP patients placed in home care. For 
purposes of this study types C and D were viewed as orientations towards rules 
likely to result in sustained obedience, while types A and B were viewed as orienta- 
tions less likely to result in conformity. The four types of conformity were: 

Type A. Conformity is rare and difficult. To conform, the patient would need much help» 
complete supervision, and tolerance by the sponsor. Capacity to conform is extremely limited- 
Examples from the test reports: “He is inadequate in this respect since he cannot accept nor meet 
demands made upon him. He requires excessive supervision in situations which he finds uncon- 
genial.” “Although the patient seems able to conform to the major requirements of social living, 
his confused, paranoid perceptions may often lead to bizarre distorted actions which possibly 
might require professional, hospital care.” 


Type B. Can conform fairly well except under stress. There is a shaky balance between a 
artially internalized drive to conform and environmental supports on one hand, and deviant 
ehavior on the other. Examples: “The patient has the ability to conform to more formal or 

non-personal situations but will find difficulty in, or will refuse to conform to social situations.” 
“Probably conforms well in routine activities. Difficulties may arise in any situation where his 
adequacy is challenged.’”’ ‘He is able to conform fairly well because he does not usually allow 
himself to become involved.” 


pe C. Needs, but also looks for and utilizes external aids to conformity. Tolerance of 
some iain and strong, clear limits are needed. Examples: “Given a highly structured situa- 
tion, with frequent direction from an authority, this patient would probably conform completely 

. .. Under conditions of greater freedom he would probably resist conformity in a confused and 
rather pointless manner.” “His capacity to conform is superficially adequate for a protected and 
encouraging environment .... He conforms where he can see that this is to his advantage, without 
introjecting environmental standards.” 


Type D. Control is internalized to a great extent; conformity is habitual and frequently 
one of the major characteristics of the patient. The patient accepts much of the responsibility 
to conform. Examples: ‘He feels that in order to exist he must conform to almost everything 
that is requested of him.’’ ‘‘The patient is over-conforming. While this kind of rigid control has 
a number of concomitant disadvantages, it would appear to make him docile and manageable in 
most practical situations.” 


Assignment of Patients to Type of Conformity. On the basis of the test reports, all 112 
patients were assigned to one of the above types of conformity by both a psychologist 
and a social worker. This procedure permitted: (a) the evaluation of the validity of 
the categorizations of both judges as well as the reliability between the two judges; 
(b) the comparison of the reliability with which categorizations of type of conform- 
ity were made from the two types of reports; and, (c) the collection of preliminary 
findings as to whether greater knowledge of the tool used (one psychologist) or great- 
er _—— of the field of application (one social worker) would lead to greater 
validity. 


External Criteria. The relative conformity of patients after they were placed in 
home care was measured by sponsors’ answers to the Post Hospital Adjustment 
Scale (PHAS) devised by Hamister®?, an 80 item check list including a 21 item sub- 
scale of acceptable and unacceptable social behaviors. The corrected odd-even re- 
liability for the PHAS conformity scale has been reported to be .87°. Scores of the 
PHAS conformity seale filled out by sponsors were available for 50 of the patients in 
test report samples I and II. 

The other criterion used, called the per cent gain score, dealt with the benefit 
patients obtained from home care in terms of remaining out of NP hospitals. This 
score was the percentage of time each patient stayed out of all NP hospitals from the 
date of first placement to the cut-off date for this study minus the percentage of 
time the patient stayed out of NP hospitals from the time of his first breakdown to 
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the time of his first home care placement. For the 191 patients from whom the 112 
patients of this study were drawn, Ullmann and Berkman“? found that the median 
percentage of time spent out of NP hospitals was 9014 per cent, while the median per- 
centage of time spent out of NP hospitals from first breakdown to first placement was 
25 per cent. Using a Wilcoxon T test “!: PP. 5-83), the gain in percentage of time out 
of NP hospitals after placement yielded a critical ratio of 8.95. The advantage of 
the per cent gain score is that the patient is his own control. For example, two 
patients may both remain out of hospitals 50 per cent of the time after first place- 
ment. If one patient had been continuously hospitalized since his first breakdown, 
there is a gain of 50 per cent. If the other patient had been hospitalized only 25 per 
cent of the time since his first breakdown, remaining out of hospitals 50 per cent of 
the time after placement represents a relative loss of 25 per cent. It seems reasonable 
to say that the first man gained more from home care than the second, although the 
—- percentage of time spent out of NP hospitals after placement was the same 
or both. 


RESULTS 


Inter-judge Reliability. The psychologist and social worker categorized patients as 
to conformity on the basis of information communicated in psychological test re- 
ports. In terms of likely (C and D) and not likely (A and B) to conform, for the 62 
cases of sample I test reports, the two judges agreed in 44 of the 62 cases. This 
amount of agreement is significant at the .01 level. For group II, the two judges 
agreed as to the likelihood of conformity for 20 of the 21 cases. However, some of the 
reports of sample II had been used as examples for the definitions of types of con- 
formity, and therefore the obtained agreement with this sample might well have 
been inflated. On the routine test reports of sample III where no specific comments 
about conformity were required, the two judges showed chance agreement, placing 
only 16 of the 29 cases in the same way. 


Validity of Judges’ Categorizations. Differences in the mean score of external criteria 
between groups of patients classified likely (C and D) or not likely (A and B) to con- 
form on the basis of test reports were computed separately for both raters. From 
Table 1, it can be seen that patients classified as likely to conform by the psycholo- 
gist and by the social worker had significantly higher PHAS conformity scores based 
on sponsor ratings, than did patients categorized from test reports as not likely to 
conform. 
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LIKELY AND Not LIkE.Ly To CoNFoRM BY A PsycHOLOGIST (Psy.) AND A SoctaL WORKER (8S, W.) 
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From Table 2 it can be seen that patients judged likely to conform on the basis 
of test reports have significantly higher per cent gain scores for samples I and II than 
patients judged not likely to conform. Patients judged by the psychologist and the 
social worker as likely to conform on the basis of the sample III test reports did not 
— significantly in per cent gain scores from the patients judged as not likely to 
conform. 


Effect of Length of Previous Hospitalization. In both the studies of Molholm and 
Barton‘ and Ullmann and Berkman“, the number of years of NP hospitaliza- 
tion before placement (YHBP) were significantly related to the outcome of place- 
ment. The median YHBP for the 191 patients from which the 112 subjects of this 
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study were drawn was 7.0 years. The 112 patients of the three subsamples used in 
this research did not differ significantly as to median YHBP when compared to the 
larger sample of 191 patients from which they were drawn: 51 patients in the test 
report samples had fewer than 7.0 years YHBP while 61 cases had more than 7.0 
years YHBP. However, the 83 patients in test report samples I and II had a median 
YHBP of 10.0 years, while the 29 patients in test report sample III had a median 
YHBP of 2.2 years. 

High YHBP was associated with high percentage gain. Using 7.0 years YHBP 
as the cutting score, 77.7 per cent of the 112 patients were correctly identified as 
having per cent gain scores above or below 50 per cent, a relationship which is sig- 
nificant well beyond the .001 level (critical ratio equals 5.15). In contrast, the social 
worker’s judgments of conformity correctly identified per cent gain scores above or 
below 50 per cent for 64.3 per cent of the 112 patients, a relationship which is signi- 
ficant at the .01 level (critical ratio equals 2.61). The psychologist’s use of the test 
reports led to correct identification of 58.9 percent of the 112 cases as above or below 
a per cent gain of 50 per cent, a relationship which is not statistically significant 
(critical ratio equals 1.59). 


The greater usefulness of YHBP compared to judgments based on test reports 
remained clear when the 83 cases in samples I and II were used without sample III. 
This was a severe test of the relative importance of YHBP because the two samples 
used were the ones in which patients judged as likely to conform by the psychologist 
and social worker had differed significantly in terms of both criteria from patients 
judged not likely to conform. To test whether it was clinically useful to add judg- » 
ments based on test reports to YHBP, fourfold tables were converted to tetrachoric 
correlations? and the multiple correlations between a criterion and YHBP plus test 
judgments were compared with the correlation between a criterion and YHBP alone. 
The tetrachoric correlation between YHBP and per cent gain was .66, while the 
multiple correlation between per cent gain and YHBP plus judgments based on test 
reports was .67 for the addition of both the psychologist’s and social worker’s judg- 
ments. The tetrachoric correlation between YHBP and PHAS conformity scores 
was .65, while the multiple correlation between PHAS conformity and YHBP plus 
test reports was .66 when the psychologist’s judgments were used and .70 when the 
social worker’s judgments were used. 

As the reader might have deduced, the trivial increment yielded by the addition 
of judgments based on test reports, was due to significant relationships between 
YHBP and these judgments as well as the higher relationships between YHBP and 
the criteria. The tetrachoric correlation between YHBP and judgments of conform- 
ity was .42 for the psychologist and .43 for the social worker for the 83 cases of 
samples I and II. 
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Discussion 


The reliability data obtained in this study indicated that test reports which 
forced the discussion of specific aspects of personality within the context of a parti- 
cular environment led to increased reliability of judges comparing patients. While 
evaluating the relationship of each judge’s categorizations to the criteria, it occurred 
to the authors that the validity of the scores of reliability raters could be routinely 
included in research reports. The single additional computation would obviate the 
necessity of assuming equal validity on the part of two raters whose reliability might 
be quite low although statistically significant. 

In this study, the information derived from test reports, while statistically 
significant in relation to the two criteria, added nothing to the relationship obtained 
by the use of YHBP alone. YHBP could conceivably be related to per cent gain by 
statistical artifact, but the relationships between YHBP and PHAS conformity 
scores and between YHBP and judgments of conformity based on test reports argues 
against this, as do previous findings? by Ullmann and Berkman“*) and Molholm 
and Barton.‘ P- 33, Table5) The fact that all of the subjects in this study were VA 
patients placed in home care limits generalizations based on the obtained relation- 
ships, although the very homogeneity of the population would tend to decrease the 
obtained correlations. 

Perhaps the most interesting results obtained in this study were the significant 
positive correlations between years of hospitalization before placement and test and 
sponsor scores of conformity. It may be that these correlations reflect that men who 
were quiet and obedient remained in the hospital long periods of time because they 
did not bring themselves to the staff’s attention. On the other hand, it may be that 
long hospitalization had the effect of teaching the patients to avoid acts that would 
cause censure by the staff. This latter possibility calls to mind the finding of Atkin- 
son”) and Robinson? that schizophrenics learn more readily when punished for in- 
correct responses while normal subjects learn more readily when rewarded for correct 
responses. Long hospitalization may conceivably accentuate this pattern of relating 
to the environment. For the patients in this study, the effect of long hospitalization 
was not deterioration but likelihood of adjustment in home care, a protected and 
limited social situation. The patients who might be thought of as closest to normal 
because they are most responsive to the environment, those patients who conform 
except under stress, had had relatively short previous periods of NP hospitalization 
and did not do well in home care. 

On the basis of considerations such as these as well as previous findings dealing 
with the relationship between YHBP and length of stay in home care, it is possible 
to wonder whether it is more effective for a hospital to try to teach all patients to 
seek positive rewards from the environment or whether it is more realistic to accept 
the partial disability involved in a limited social recovery. Probably this line of spec- 
ulation should be stated as a question: Is there a point at which it becomes thera- 
peutic to stop trying to change a schizophrenic’s pattern of relating to his environ- 
ment and to accept this pattern with a view to establishing a limited extramural 
adjustment? 


SUMMARY 


Using three groups of psychological test reports, a social worker and a psycholo- 
gist judged the type of conformity likely to be displayed by 112 neuropsychiatric 
patients placed in home care. The usefulness of these judgments was evaluated in 
terms of two criteria: (a) the per cent of time subjects remained out of hospitals 
after home care placement less the per cent of time they had remained out of hospi- 
tals from first breakdown to first placement, and (b) the subjects’ conforming be- 
havior in the home as measured by sponsors’ answers to the conformity items of 


2In both studies: (a) remaining in home care was significantly associated with high YHBP; (b) 
living independently was significantly associated with low YHBP; and (c) being returned to the 
hospital was randomly associated with YHBP. 
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Hamister’s Post Hospital Adjustment Scale®’. When test reports were used which 
explicitly discussed conformity, patients categorized as likely to conform showed 
significantly higher scores on both criteria for both judges than patients categorized 
as not likely to conform. When “routine”’ test reports which did not explicitly dis- 
cuss conformity were used, neither of the judges’ categorizations showed group 
differences for either of the two criteria. However, number of years of neuropsychia- 
tric hospitalization before placement was significantly related to conformity ratings 
based on test reports and to the two criteria in such a manner that amount of pre- 
vious hospitalization was (a) a better predictor of the criteria than judgments based 
on test reports, and (b) not improved by the addition of the information based on 
test reports. The findings were discussed in terms of research procedures and the 
possible effect on patients of long hospitalization. 
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PSYCHIATRIC PATIENT AUDIENCE REACTIONS TO 
TYPES OF MOTION PICTURES 


JOHN W. WHITMYRE 
Veterans Administration Hospital, Salt Lake City, Utah 


PROBLEM 


Many kinds of recreation and entertainment are now generally accepted as 
appropriate parts of the overall treatment program for hospitalized psychiatric 
patients. In the Veterans Administration, these activities are for the most part under 
the direction of the Special Service Department and include such things as parties, 
dances, bus rides, and movies shown at the hospital. Very little attempt has been 
made, however, to determine whether these have any appreciable effect upon patient 
improvement. The present investigation is an approach to learning more about the 
value of specific recreational activities, such as motion pictures. 

In the Veterans Administration, the motion pictures shown to psychiatric 
patients as part of their recreation program are selected by a committee in the VA 
Central Office. While this committee rejects pictures which show certain character- 
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istics considered to be undesirable, there has been no experimental evidence available 
to justify such rejection. In addition, it is not known whether it is more desirable to 
show patients light comedies, for example, which could be considered to divert them 
from their problems and anxieties, or dramas, which might “hold the mirror up to 
Nature”’ and conceivably be of more value in the long run. 

The aim of the present study is to determine whether there is any discernable 
relationship between the type of movie shown a psychiatric patient audience and the 
degree of overt disturbance which the audience shows while viewing the picture. 
Also, the validity of rejecting a priori certain pictures for psychiatric patient aud- 
iences is to be evaluated. 


METHOD 


The Theatre Behavior Rating Scale. One study“ in the general area sought to 
show the relationship of the stimulus of one specific motion picture (From Here to 
Eternity) to patient behavior following viewing of the movie. This investigation re- 
vealed many difficulties in attempting to separate the possible effects of the movie 
per se upon patient behavior from the effects of other influences, such as diurnal var- 
iations. In the present study, this problem was minimized by limiting observations 
to patient behavior in the theatre situation while the stimulus movies were being 
shown. 

The first problem, then, was one of determining what behaviors to look for in 
the theatre as the patients viewed the screen, and how to evaluate these behaviors. 
The following procedure was carried out. A list was drawn up of twenty-one relative- 
ly concrete and easily observable behaviors. Each of these, in the author’s opinion, 
would reflect some degree of mental or emotional disturbance if it occurred in the 
social setting of a movie audience. These twenty-one behaviors were presented to a 
group of judges, each judge individually. Six psychiatrists and four psychologists 
comprised the group. The instructions were to place the cards describing the be- 
haviors in three piles according to whether the behaviors would reflect a great degree 
of mental or emotional disturbance, a moderate degree, or a slight degree, were the 
behaviors to occur in a movie audience comprised of NP patients. The judges then 
ranked the behaviors within the piles, combined piles, and thus ranked all behaviors 
from that indicating greatest disturbance to that indicating least disturbance. 

An estimate of the reliability of rankings was made by randomly dividing the 
judges into two subgroups, determining the total rank value assigned each item by 
these subgroups of judges, and correlating these values. This yielded an r of .970 
after correction for halving the number of judges. While this value was exceedingly 
high, it was quite apparent that certain items were being ranked with much less 
agreement than others. Guilford offers a method to distinguish the items whose 
variance is disproportionate in such cases. By use of this procedure, the seven items 
whose standard errors were disproportionately large as compared to the standard 
error of the proportions they represented were eliminated from the study. Thus 
fourteen items of behavior remained, the judgments of which all showed less than 
expected variability. 

These fourteen behaviors were presented to a new group of six psychiatrists and 
four psychologists with the same instructions as above. In this case, when the 
method of dividing the judges randomly into two groups was applied, an r value of 
.972 was obtained. The ranks assigned these items were then converted to scale 
values according to the method of Guilford“) where the lowest ranked item gets a 
zero weight. To make it possible to assign some weight to even the lowest ranked of 
the original items, an additional item was arbitrarily introduced before scaling. This 
item, ‘‘watching the screen attentively”, was assigned the lowest rank, and the rank 
value given to each other item by each judge was raised by one. Thus, the “watching 
the screen attentively” was necessarily given a zero weight. The scale values are pre- 
sented in Table 1, along with the final rounded values used in computation of the 
index of disturbance. 
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TABLE 1. ScaLE AND Frnav VALUES ASSIGNED TO ITEMS UsEp To CoMPUTE 
THE INDEX OF DISTURBANCE 








Items Scale Value Final Value 
Watching screen attentively .00 0 
Whispering to neighbor .90 9 
Leaving seat 1.02 10 
Scratching self 1.07 11 
Sleeping 1.11 11 
Talking aloud to neighbor 1.66 17 
Swinging arms 1.91 19 
Singing to self 1.96 20 
Covering ears 1.97 20 
Talking to self 2.11 21 
Laughing inappropriately 2.18 22 
Clapping inappropriately 2.46 25 
Shouting 2.78 28 
Masturbatin 2.95 30 
Exposing sel 3.20 32 





Observations. All observations was made in the theatre at the Fort Douglas VA 
Hospital, Salt Lake City, Utah. The observers were two graduate students in psy- 
chology who were also VA Clinical Psychology Trainees. Each behavior cited in 
Table 1 was defined as critically as possible in order to aid the observers in making 
reliable judgments. 

Each of two observers was assigned a specific patient subsample to observe. 
The observer seated himself in such a way that he could unobtrusively watch a group 
of ten to twelve patients. Before the movie began, all observers simultaneously 
started their synchronized stop watches. An observation was made each minute 
throughout the duration of the picture. Because of the synchronous timing, ob- 
servations in each of the two sub-groups occurred at the same points in the picture. 
An “instantaneous method” of observation was used. At the beginning of each min- 
ute by the watch, each observer fixed his attention for about 2 seconds on the first 
member of the patient subgroup being observed. Noting mentally if that patient 
showed one of the 14 disturbed behaviors, he moved on to fix his attention to the 
next patient. He continued in this fashion until all the patients were observed. This 
encompassed about 20 seconds each minute. The observer then recorded on a pre- 
pared paper, by code letters, the disturbed behaviors shown by each patient. Each 
patient could contribute only one incident per minute. 

This method eliminated the problem of defining when one incident of a behavior 
ended and another began. There was no possibility of disagreement about whether a 
patient showed arm swinging once or twice in the period, for example. 


Motion Pictures. It was rather difficult to select the most appropriate and clearly 
definable categories of motion pictures to evaluate. By a survey of movies previously 
shown patients under the existing VA selection procedures, it was determined that 
the following kinds of pictures had occurred in a serial order which was random: 
western, adventure, comedy, drama. A few other kinds of pictures occurred, but 
these were disregarded. These, then, were four of the categories of motion pictures 
to be studied. 

In addition, it was learned through the cooperation of the VA Assistant Chief 
Medical Director for Planning that there are several categories of movies banned for 
use in VA Hospitals. Only one of these seemed appropriate to study and clearly de- 
finable: movies depicting war scenes with the central plot and theme centered on 
military conflicts. Most of the other bases for banning were overlapping and rather 
loosely defined. 

The categories of movies studied in this project then met the following defini- 
tions: 
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a. Westerns. Movies concerned with outdoor western American life and dealing with 
essentially a stereotyped plot involving obvious overt conflict between personified forces of good 
and evil. 

b. Adventure. Movies involving physical danger, usually obvious overt conflict between 
personified forces of good and evil, and often involving the overcoming of natural obstacles also. 

c. Comedy. Movies with obvious intent to be primarily humorous. Not to include subtle 
humor as by use of parody, inuendo, ete., but movies where plot, situations, and characterizations 
are frankly and overtly humorous. 


d. Drama. Movies where the plot, situations, and characterizations are rather clearly de- 
signed to show inner conflict, problems of motivation, and other non-humorous, non-externalized 
difficulties. 


e. War Movies. Taken from VA lists: 


In deciding whether a movie scheduled to be shown in the hospital was to be 
included in the study, the motion picture synopsis, as provided in Variety magazine 
and/or VA Central Office was studied and compared by the author with the above 
criteria. If it was adjudged to meet one of the criteria, and the movie of that type 
had not already occurred in the series, the observers were informed and the usual 
procedure carried out. 

Whenever four movies had been chosen and observed in this manner, the Chief, 
Special Service, made arrangements for a movie from category e. to be shown. Thus 
approximately every fifth movie shown (for our purposes) was one of the banned 
films. In all, two series of five movies each were studied, the second being on differ- 
ent patients from the two subsamples and thus constituting a replication. 


Patient Subgroups. Because of practical difficulties, not all patients in the 
hospital could be considered in trying to choose sub-populations to sample. It was 
felt that the most valuable and yet practical basis upon which to make the selection 
would be one which involved the patient’s general awareness of his surroundings 
and alertness to his environment. If, for example, it could be determined that pa- 
tients apparently unresponsive to environmental stimuli show as much overt dis- 
turbance at a war picture as do patients obviously responsive to their environment, 
this might give hints about some approaches to bring environmental reality closer 
to these people. 

Two groups of patients were chosen as hospital sub-populations from which 
the samples to be observed were drawn. These two groups were two hospital wards 
which can be described as follows: Group I. Continued Treatment psychiatric 
patients on an open ward and in good contact with reality and generally responsive 
to environmental stimuli. Group II. Continued Treatment psychiatric patients on a 
closed ward, in poor contact with reality, and generally quite unresponsive to en- 
vironmental stimuli. 

Each judge observed one ward for a series of five movies, one movie of each of 
the five categories: western, drama, adventure, comedy, war. Then the entire pro- 
cedure was repeated on a new series of five movies. Because of the requirements of 
the statistical design, it was necessary that the identical subjects be observed at each 
movie in the series. This led to severe attrition. In the first series, six patients in 
each ward were observed at every one of the pictures, and in the second series, seven 
patients in each ward. The total score a patient achieved through demonstrating the 
various critical behaviors at a given movie was divided by the total minutes he was 
under observation at that movie, yielding an average disturbance score. These 
average disturbance scores were the data treated in the analysis of variance des- 
cribed below. 


RESULTS 
The use of the described observation technique led to quite high interjudge 
agreement. On a preliminary trial the two judges observed the same patient sub- 
sample, recording behaviors just as in the study itself. These average disturbance 
scores for the patients were not normalized as were the average disturbance scores 
used in the study proper. The average disturbance scores for each of ten patients, 
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based on the independent observations of the two judges, were correlated by product- 
moment correlation, yielding an r of .96. The two judges did not differ significantly 
in the mean average disturbance scores they ascribed to this preliminary trial group, 
thus indicating an absence of rater bias. 

In the study itself, the distribution of the average disturbance scores of the 
patients was found to be markedly skewed in every group, mainly because many 
patients had zero scores for some movies, 7.e., showed no incidents of disturbed be- 
havior. Before submitting the scores to analysis of variance, the scores for each 
patient group were normalized for each series of five movies by converting them to 
T-scores with a mean of 50 and a sigma of 10, according to the method of Edwards”. 

The scores were then evaluated by the analysis of variance described by Ed- 
wards“? as an analysis of repeated measurements on several independent groups. 
Two separate analyses were made, one for Series I and one for Series II. The results 
are shown in Table 2 


TABLE 2. ANALYSIS OF VARIANCE FOR SERIES I ANp IIT 











Series I 








Source of Variation Sum of Squares df Mean Square F 
Between wards 912.2 1 912.2 5.92* 
Between S’s in same ward 1541.4 10 154.1 
Between movies 288 .0 4 74.5 1.32 
Interaction, movies and wards 85.5 4 21.4 0.38 
Interaction, pooled S’s and trials 2256 .9 40 56.4 

Series IT 
Between wards 16.0 1 16.0 0.12 
Between S’s in same ward 1638 .2 12 136.5 
Between movies 311.5 4 we 1.08 
Interaction, movies and wards 358.1 4 89.5 1.24 
Interaction, pooled S’s and trials 3470.2 48 72.3 





*Significant at .05 level of confidence. 


There is only one significant F ratio in the table. This is for the first series, 
between patient groups. Observation shows that Group I, the patients in good con- 
tact with reality, responsive to environmental stimuli, and on an open ward, showed 
significantly more disturbed audience behavior than did Group II. In no other case 
do the F ratios approach significance in either Series I or II. This indicates that the 
movies do not differ systematically in their tendency to elicit disturbance in the 
patients. Also, the interaction between movies and patient groups is in neither case 
significant, thus showing no tendency for certain kinds of movies to disturb certain 
kinds of patients. The difference in disturbance between the groups did not recur in 
the second series. 


DISCUSSION 


In each series, a banned movie primarily concerned with war was included, 
These movies did not evoke significantly more disturbance than did any of the other 
films, as measured by the Theatre Behavior Rating Scale. The impression that 
patients were not notably overtly upset by the war films was further substantiated 
by a review of the Nursing Service reports encompassing the night following the 
showing of the banned films, which revealed no increase in patient restlessness in the 
wards in question, insomnia, or in requests for medication. In general, an average 
night was reported. These ‘findings should encourage further investigation of the 
possibility of loosening restrictions of the kind of entertainment allowed mental 
patients. 

The failure of this limited study to reveal a systematic relationship between 
movie type and patient reaction does not provide any additional information about 
which films might be best for patients. It seems to matter very little, for the patients 
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are a remarkably undisturbed audience, by the criteria used, regardless of the kind of 
movie entertainment. If they felt an increase in tension or inner discomfort it was 
not reflected behaviorally. 

The greater disturbance scores among the “‘better’’ patients is not unexpected. 
If these patients are in fact more responsive to their environment they might very 
well be responding more to the films, regardless of the specific content of the movies. 
The trend is not strong however, and did not appear in the replication. 


SUMMARY 


Two groups of hospitalized veteran male NP patients were studied in regard to 
the overt disturbed behavior demonstrated while viewing different kinds of motion 
pictures. The amount of disturbance was determined by observing critical scaled 
disturbed behaviors of the patients in the audience, and calculating an average dis- 
turbance score for each patient at each movie. Two series of five movies each were 
studied. An analysis of variance revealed no significant relationship between movie 
type (western, drama, adventure, comedy, and war) and patient group (relatively 
responsive to environment, on open ward vs. relatively unresponsive to environment, 
on closed ward). 

In the first series only, there was a significant tendency (.05 level of confidence) 
for the more responsive patients to show more disturbance, regardless of the in- 
fluence of type of movie. The results encourage further exploration in the relaxing of 
restrictions on the entertainment and other activities permitted psychiatric patients. 
Apparently some stimuli which would rationally be expected to be disturbing to 
patients are received without incident. 
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AMBIGUITIES IN THE ROLE OF CLINICAL PSYCHOLOGIST 
IN A MENTAL HOSPITAL 


ROBERT SOMMER AND IAN L, W. CLANCY 
The Saskatchewan Hospital, Weyburn, Sask. 


The clinical psychologist is a comparative newcomer to the state mental hos- 
pital. It is only in recent years that any non-medical professional people have entered 
the doors so long occupied by physicians, ‘‘keepers”, and patients. As might be 
expected of any profession in a new setting, the role of clinical psychologist is rather 
ambiguous. A great deal has been written about his duties, especially where they 
overlap with those of other professions, yet very little is known about his place in 
the hospital social structure. Although there is considerable variation in certain 
functions (e.g., psychotherapy, research, training) from setting to setting, the basic 
components of his role are fairly constant. At the risk of appearing outsiders treading 
in another’s province, this paper discusses the role of clinical psychologist in a state 
mental hospital. Previous discussions of “‘the place” of the clinical psychologist 
have over-looked many of the fundamental concepts of social analysis, e.g., role, 
status, function, staff and line organization. Instead of an appraisal along these lines, 
the discussions usually became polemics about lay psychotherapy, medical responsi- 
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bility, or the contributions of a psychologist to a psychiatric team. Although all of 
these questions are interesting and thought-provoking, they are largely irrelevant to 
the role of the psychologist in the total hospital structure. A social-psychological 
analysis of the role of the clinical psychologist may reveal that many of his problems 
are more than manifestations of tradition-bound hospital administrations or auto- 
cratic personalities. It is more likely that these problems are an outgrowth of a 
poorly defined role and of an overlooking of the distinction between line organization 
(the front lines) and staff organization (the supply forces) within the mental hospital. 
For example, one writer“, speaks of testing as the psychologist’s contribution to the 
patient’s care. In terms of the raison d’etre of the hospital, the testing is a staff 
function while patient care is a line function. 


A. Roe 

Role refers to a set of legitimate expectations of behavior, and the cohesiveness 
of any social structure depends on the effectiveness of the performance of important 
roles®), An unusual aspect of the situation is that the posztion (job classification) of 
a psychologist is often established in a mental hospital without any clear idea of what 
his role is to be. The hospital administration desires to be progressive, and the 
hiring of clinical psychologists seems a progressive thing to do.'! Both the administra- 
tion and the clinical psychologist are in the dark as to what his role will be. 

One social scientist “’, has divided the mental hospital into three social systems: 
Clinical, Business, and Patient. One must still go one step further and distinguish 
between staff and line organization in each system. Otherwise one can easily fall into 
the error of assuming that all members of the clinical system are part of the line 
organization and the business system constitutes the staff organization. In any state 
hospital, only certain segments of the clinical system are actually part of the line 
organization. The physician who is in charge of a 150-250 patients may not be a 
part of it, nor the social worker who has the responsibility for 500 patients. The 
“front lines’ are the psychiatric nurses.? The psychologist who reads the results of his 
testing at a conference, is presenting them to the staff organization. There may be a 
considerable lapse of time before they “‘filter down” to the line organization and 
actually influence the treatment of the patients. Hospitals often contain specific 
barriers that impede or prevent such communication. It is almost an unwritten law 
that nursing personnel (especially those who work on the wards) may not have 
access to the ‘‘front office’’ file on the patient where the psychologist’s report is con- 
tained. Ward nursing personnel are sometimes not encouraged to attend the staff 
conferences where such reports are presented. Finally the psychologist’s report is 
usually phrased in such a way that it contains little that is of assistance to the ward 
personnel. The psychologist’s recommendation may pertain to the patient’s need 
for individual psychotherapy, somatic treatment to break a delusional system, or 
transfer to another institution. None of these considerations directly concern the 
ward nursing personnel.’ 

This situation can clarify the problem mentioned by Stainbrook “?, of the nurse 
‘“‘who is in direct relationship with the patient”’ being the most silent participant at 
the staff conference. The reason becomes clear when the distinction is made between 
staff and line organizations of the hospital. This type of conference where each 
member of the “‘team”’ presents his findings is a meeting of the staff organization. In 
fact, the nurse’s role is often stated to be advising the other members of the team as 





1This is also true for other specialties. BELDMAN, D. A critical period for OT in mental hospitals. 
Occup. Therapy, 1957, 30, p. 18, mentions prestige value as a factor in bringing O.T. into mental 
hospitals. 

2As used in this paper, ‘‘psychiatric nurse’ is a generic term covering all those who actually work 

on the wards (psychiatric aides, registered nurses, psychiatric nurses, etc.). 

3Note that this is not the point that the language of the psychologist may be unintelligible to 
other disciplines. The point emphasized here is that psychologist’s recommendations are presented to 
the hospital staff organization, and are not directly applicable to the functions of the members of the 
line organization. 
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to the patient’s behavior on the ward. They are not present at the conference to 
advise her; rather she is present to advise them. The paradox is that she is the one 
who must interact with the patient and receives the least benefit from the staff 
conference. 


B. AutTuHority, RESPONSIBILITY, AND STATUS 


Authority. In few mental hospitals does the clinical psychologist possess any line 
authority. This can be distinguished from his sapiential (advice-giving) authority 
in the staff organization. With the exceptions of his secretary and psychology train- 
ees, he cannot order anyone to do anything. He can request that the ward personnel 
send a patient up for psychotherapy or ask the psychiatrist to delay EST on a patient 
who is part of a research project, but neither of these are cases of formal authority. 


Responsibility. Essentially, the psychologist has very few formal responsibilities. 
He is responsible for testing patients or setting up a therapy group when formally 
requested to do so by the psychiatrist. (If the psychologist initiates this himself, 
then he is no longer responsible to anyone except the Clinical Director.) Further- 
more the psychologist is not responsible for the recommendations contained in his 
test report. His recommendations are given only as advice and it is a common com- 
plaint among clinical psychologists, that, in writing the case report, the psychiatrist 
will include only those portions of the psychological report that he (the psychiatrist) 
agrees with. Regardless of whether the psychiatrist accepts or rejects the psycholo- 
gist’s recommendations, the psychiatrist remains responsible for the patient. This 
is unlike the situation in the medical disciplines where the specialist may give advice 
to the general practitioner. If the GP does not take the advice, then the GP is 
legally responsible for the consequences. If he does accept the advice, the specialist 
becomes legally responsible for the consequences. This is not the position of the 
psychologist, nor would many psychologists care to accept legal responsibility for 
their recommendations. 

However, the psychologist’s recommendations do carry the imperatives of 
sapiential authority and he is responsible to his superior, often the Clinical Director, 
to give good advice. If his advice proves poor, he can be sanctioned, but this also 
requires the support of his superior in having his advice acted on. If his superior 
does not back him up, then the superior is negligent in employing him at all. 


Status. Traditionally in mental hospitals, status has gone hand in hand with author- 
ity. The more authority one had, the greater prestige he enjoyed. The same cannot 
be said of responsibility. The attendant who was in charge of many patients had 
very low status in the hospital hierarchy compared with the carpenter or electrician 
who had very few responsibilities. However, in the case of the psychologist, one finds 
an exception to the concomitance of status and authority. The psychologist enjoys 
relatively high status in the hospital hierarchy although he has no authority. 
Perhaps it would be useful to examine this situation and trace its development. 
Some years ago the physicians in a mental hospital were quite isolated from the rest 
of the staff. They ate in separate dining rooms and lived in separate cottages. There 
were no others on the hospital staff who could be considered their social equals. 
Perhaps some registered nurses and social workers were university educated, but 
there was usually a sex difference and, in the former case, familiarity between physi- 
cian and nurse was frowned upon. Since the 1940’s a new group, the clinical psycho- 
logists, entered the mental hospital. They were for the most part the social equals of 
the physicians, all had university degrees, many were called ‘‘doctor’’, and they were 
brim full of new ideas. Although some of the older physicians felt threatened by 
them, the younger residents and psychiatrists considered them to be kindred 
spirits (especially ideologically where the professional staff was split between those 
with therapeutic and those with custodial orientations). After the initial adjustment 
period was passed, the clinical psychologist was usually given the accoutrements of 
high status; the white coat, the master key, access to the case files, and the title 
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“doctor” (even for the M. A. psychologist). To the line staff (as to the lay public) 
the terms “psychologist” and ‘‘psychiatrist’’ were quite similar and, twenty years 
ago in the mental hospitals, a doctor was a medical doctor. The psychologist also 
attended hallowed staff meetings and was usually observed to be quite familiar 
socially with individual physicians. In essence the psychologist entered the hospital 
as a “‘doctor’’. However, many are still in doubt how to address and interact with a 
clinical psychologist. In essence a considerable portion of the psychologist’s status is 
reflected from the status of the physician. From the standpoint of the line organiza- 
tion, this is status via tradition rather than earned status. Very few in the line 
organization have a clear idea of the functions of a psychologist; the ward nurse per- 
ceives only that the psychologist has a master key, a private office, title “‘doctor’’, 
close association with the ward physician at coffee, and that he visits the ward at his 
convenience to request to see a particular patient. Since the results of the psycholo- 
gist’s testing are not apparent to the ward nurse, the psychologist is unable to earn 
status through merit. It is only through hearsay and perhaps an occasional lecture 
in a training program, that the nurse learns to respect the psychologist for what he 
does, rather than what he represents. 


C. A New Route 


From the proceeding analysis, it would be incorrect to assume that the psycholo- 
gist has no role within the mental hospital. He does have a role but it is tenuous be- 
cause of its isolation from other roles in the system. The clinical psy shologist, in his 
capacity of psychometrician (used in the broad sense of employing a variety of 
diagnostic tests, rather than simple intelligence tests) acts as a consultant to the 
physician or psychiatric team. Unfortunately the physician is often uncertain of 
what to do with the psychologist’s advice. Furthermore the psychologist feels im- 
potent because he has no support from the line organization of the hospital. 

Both these situations are amenable to solution. The psychologist gua psycho- 
metrician,‘ serves in an advisory capacity to the physician (or a team structure of 
some sort). If he does not communicate in terms that are meaningful to the physician 
or team, then the fault is his and it should be corrected. However if the physician 
understands the content of the psychologist’s report but does not know how to use 
the information, then the fault is that of the Clinical Director for improperly de- 
fining the role of the psychologist. If a specialist, whether it be an O. T. or surgeon, 
performs his work satisfactorily but is of little use to the hospital, the specialist is 
hardly culpable. Rather, the Clinical Director, or the person to whom both the 
physician and the psychologist are responsible, is the one who must take the blame. 
Many positions (job classifications) are instituted in mental hospitals because of 
fads for one therapy or another. At one time, there is a need for hydrotherapists and 
educational therapists, at another a need for biochemists and geneticists, or perhaps 
sociologists and anthropologists. 

In many cases the Clinical Psychologist entered the mental hospital without 
anyone having a clear idea of what he was to do. If he were enterprising and skilled 
in administrative protocol, he often created a role in which he felt comfortable. 
Usually this included some testing, research, supervision of psychology interns, and 
perhaps an occasional editing of the hospital paper, organizing of a journal club, and 
selection of films for mental health week. Gradually the psychologists became more 
and more isolated from other hospital departments. Research, training of interns, 
and testing are certainly laudable, but they are not part of the specific hospital line 
functions (the care and treatment of patients and the protection of the community). 
The only other possibility for his contributing to the hospital lay in his communicat- 
ing with the staff organization. Unfortunately, all too often, these personnel did not 
know how to use his services properly. Some members, often young residents, were 
willing to accept diagnosis as an intellectual exercise and considered case conferences 
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as opportunities to test their knowledge of dynamics against the psychologist’s. 
Whoever was the victor was of little consequence to either nurses or patients. 

The problem now is to clarify the psychologists’ role in the hospital structure. 
It would be inappropriate to define it here, but we may attempt to point out some of 
the ways in which he might be integrated into the structure. First, there is the need 
to clarify his role as advisor to other members of the staff organization and to effect 
a satisfactory readjustment of the complementary roles throughout the hospital 
social system to accommodate and integrate this new specialist. The psychologist’s 
test reports should not be ends-in-themselves. In large mental hospitals where the 
diagnosis often does not set the treatment, there may be some question as to the value 
of psychological testing. It is up to the Clinical Director to make this decision. Be- 
cause a psychologist is competent in testing or because the concept of a tripartite 
team is in vogue, are not sufficient reasons to encourage testing in vacuo. The clinical 
psychologist has had several years of intensive and closely supervised training, a large 
portion of which has dealt with psychodynamics and psychopathology. If the Clin- 
ical Director feels that this is most efficiently employed in diagnostic testing, then 
that is his decision. Often he is unaware that the psychologist is able to do consider- 
ably more than this. Perhaps neither he nor any other member of the staff organiza- 
tion can use these other skills of the psychologist, but that does not mean that they 
would not be of use to the members of the line organization. 

This brings us to the second point, the possible benefits from the psychologist’s 
having more direct access to the nursing personnel. Nurses and O. T.’s who are en- 
couraged to form close relationships with patients deserve at least some modicum of 
emotional support to work through their anxieties. The psychologist can be of con- 
siderable assistance as part of the staff organizations for the nursing personnel. He 
can act as advisor, trouble shooter, or as a sympathetic listener when the nurse ex- 
plains her ward problems. Perhaps serious consideration should be given to the idea 
that the nurses might refer patients for testing. At this Hospital, conferences are 
scheduled between the psychologist and the nursing staff to discuss the psychologist’s 
test findings. This required a considerable re-orientation of the psychologist’s 
thinking about the goals of his report. He must place stress on the patients inter- 
personal relations, potentialities, and expectancies, and use terms that are meaning- 
ful to the nursing staff. He must also avoid the routine use of terms such as “‘sui- 
cidal’”’ or “aggressive’’ which set the staff on edge in dealing with a patient. An 
incidental purpose of these conferences is to give the psychologist a good opportunity 
to obtain rapid feedback in his report. He is able to see at first hand whether his 
report is considered useful by the nursing staff. This is a far healthier situation for 
him than the previous one where he sent his report to the psychiatrist and often 
doubted whether the psychiatrist read it or found it helpful. 

The psychologist can be exceedingly useful in the training of the nursing person- 
nel. Often he is exceptionally qualified by his education and experience for such a 
task. He may have been a teaching assistant or taken courses in group dynamics 
and related subjects; and most certainly he is accustomed to preparing reports and 
giving papers. In any of these capacities he can be invaluable in an in-service 
training program. 
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A VALIDATION STUDY OF THE CUSTODIAL MENTAL ILLNESS 
IDEOLOGY SCALE 


ARTHUR H. ROGERS, MARTIN COHEN, AND CLAUDIA S, NARANICK 
Jacksonville (Ill.) State Hospital 


PROBLEM 


The Custodial Mental Illness Ideology Scale (CMI) was introduced by Gilbert 
and Levinson"? in a study of the interrelationship of ideology, personality, and social 
system requirements. One of their interesting findings was a rank order correlation 
of .92 between an index of custodial status and modal ideology on the CMI scale. 
Rankings from most custodial status to most humanistic status were as follows: at- 
tendants, student nurses, nurses, psychiatrists. For individuals (as contrasted with 
the modal individual), a product-moment correlation of .47 was obtained between 
CMI score and the index of custodial status. 

The present study is an evaluation of the validity of the CMI scale in predicting 
success in psychiatric nursing. It is expected that student nurses found to be more 
humanistic prior to their affiliation in psychiatric nursing will be rated as more 
successful at the end of their course than student nurses found to be more custodial 
in their ideology. 


METHOD 


The Custodial Mental Illness Ideology Scale was given to 110 student nurses 
(three consecutive classes of 35, 42, and 33 students each) at the beginning of their 
twelve-week psychiatric nursing affiliation at Jacksonville (lll.) State Hospital. 
Total score was used rather than the mean per item score utilized by Gilbert and 
Levinson, 

Two criteria of success in psychiatric nursing were used. One criterion was the 
theory grade, based on four objective examinations (a total of 309 true-false items 
and 287 multiple-choice items) given during the course. The second criterion was the 
practice grade, based on ratings by clinical instructors in nursing education of the 
student’s work on the wards. For both the theory grade and the practice grade a 
thirteen-point scale was used, extending from a value of zero for F, through 13 for 
A+. 

RESULTS 

Scores on the CMI scale ranged from 40 to 96, with a median of 69.5. Chi square 
values were computed for a 2 x 2 table, with those students who received a theory 
grade of C or below on the one hand, and C+ or above on the other, being compared 
as to whether they had a CMI scale score above or below the CMI median of 69.5. 
For the practice grades, which were higher and more closely clustered than the 
theory grades, one group had a grade of B- or below, and the other group had a grade 
of B or above. For both the theory grade criterion and the practice grade criterion 
the value of chi square failed to reach statistical significance. 

A second analysis of the data was made, using 35 extreme scorers on the CMI 
scale. High CMI scores were designated as those scores of 81 or above (17 Ss) and 
low CMI scores as those scores of 58 or below (18 Ss). Both cutoff points represented 
breaks in the frequency distribution of CMI scores. Table 1 indicates the mean 
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grades of the high and low scorers on the CMI scale in both theory and practice; the 
low CMI scorers (humanistic ideology) received significantly higher grades than the 
high CMI scorers (custodial ideology). 


DISCUSSION 


While the results for the group as a whole failed to support the hypothesis ad- 
vanced, the results of the analysis of grades achieved by high and low scorers on the 
CMI scale lends support to the validity of the CMI scale. 

Our sample is atypical of the general population insofar as 48 of the student 
nurses are Roman Catholic, exceeding the percentage of Catholics in the general 
population. Whether religious affiliation entered into CMI scores (e.g., in an item 
with religious overtones, such as, ‘“‘One of the main causes in mental illness is lack of 
moral strength”) was not determined. 

Another factor that may have influenced CMI scores in this sample was that 
the CMI scale was given before any psychiatric nursing instruction had begun, lead- 
ing some students to respond as neither agreeing nor disagreeing (scored 4 points 
on the seven point scale from strongly agree to strongly disagree) to the item, 
“Once a schizophrenic, always a schizophrenic”, possibly because they may not have 
known what “schizophrenic” meant. 

The last item (“‘In experimenting with new methods of ward treatment, hospi- 
tals must consider, first and foremost, the safety of patients and personnel’) proved 
to be particularly poor in discriminating power. Almost universal strong agreement 
with this item was probably based on the social desirability involved, with a second 
level of inference being required to conclude that some other possible social good 
(e.g., possible patient benefit) might be even more desirable than safety. A related 
point is that 16 of the 20 items are custodial, so that a set to agree with the printed 
word may be involved in a high CMI score. 


Agreement with the fourth item (‘‘Mental illness is an illness like any other’’) 
is scored in the humanistic direction. While persons with a custodial ideology may 
disagree with such an item for one set of reasons, disagreement may also emanate 
from a humanistic basis. Szasz®, for instance, proposed, in a humanistic fashion, 
that ‘‘mental illness’ is not a ‘‘disease’’ that fits the model of organic medicine. 


SUMMARY 


The Custodial Mental Illness Ideology Scale was given to 110 student nurses 
at the beginning of their affiliation in psychiatric nursing. At the end of their course, 
two final grades were available as criteria against which to test the validity of the 
CMI scale in predicting success in psychiatric nursing. One grade was a theory 
grade, based on objective examinations covering the course content. The second 
grade was a practice grade, based on ratings by clinical instructors in nursing educa- 
tion of the student’s work on the wards. 

For the group as a whole, there were no significant differences in the grades 
achieved by those students scoring above and below the median CMI score. For a 
group of 35 extreme scorers (18 low CMI, 17 high CM1), however, significant differ- 
ences were found, with the low CMI scorers having significantly higher grades in 
both theory and practice than the high CMI scorers. The results lend validity to the 
CMI scale as a screening device for predicting success in psychiatric nursing. 
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ADMISSION RATES TO A STATE HOSPITAL: 1920-1956* 
PAUL A. HAUCK AND TRUEMAN HOGUE 
East Moline State Hospital, Illinois 


INTRODUCTION 

Among the myriad factors which have to do with a person becoming mentally 
ill, the temperature changes brought about by the seasons, and all the concomitant 
effects as a result of these weather states have been privately suspected by many 
people as being somehow influential in determining when a mental breakdown, or 
perhaps certain kinds of emotional disturbances occur. A three year period from 1953 
to 1955 was investigated by Hauck"? and a trend was found which was consistent 
for this period at least for those months from September to February. When the ad- 
mission rates from three other hospitals in the state of Illinois were added to the 
rates from the East Moline State Hospital, the pattern for these six months was not 
found to change substantially. The question naturally followed; how typical are 
these three years? The answer was sought by examining the admission data for East 
Moline State Hospital since 1920. 

RESULTS 

Contrary to expectation the data revealed that fewer patients were admitted 
during the severe winter months than during the warmer seasons. The high season of 
the year seemed to be the spring months with a gradual decline until the low point 
was reached in December. As before there was a rise in January and a following de- 
cline in February, after which March, April, May and June were again high rated 
months (Table 1). 


TaBLE 1. AVERAGE MONTHLY ADMISSIONS FOR THE YEARS 1920 To 1956 
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A simple analysis of variance was done to determine if any of the means for the 
twelve months differed significantly from each other. An F ratio of 1.01 is not sig- 
nificant and tells us that the monthly fluctuations noted in Table 1 are due only to 
chance factors. It was felt that a regrouping of the monthly data into the four sea- 
sons might reveal a statistical difference since the subjective impression of the data 
strongly suggested this possibility. Again a simple analysis of variance yielded an 
F ratio that failed to meet the necessary levels of confidence (F = 2.42). 


SuMMARY AND CONCLUSIONS 

What appeared to be a consistent pattern of admission rates to a state hospital 
for the three years from 1953 to 1955 proved upon further investigation to be only a 
temporary state. When admission rates were examined for each month from 1920 to 
1956 it was found by inspection that more patients were admitted during the summer 
months than during the winter months. A test of significance, however, showed that 
none of the months were actually statistically favored with more admissions than 
any other. When the months were combined into seasonal groups, there were again 
no significant statistical differences found. 

It would seem, therefore, that temporary patterns may exist at times which 
make it appear that certain months or seasons have more illness than others, but 
that over many years these monthly and seasonal changes cannot demonstrate that 
they exert this influence, at least for the mental hospital population considered as a 
whole, and for the sexes combined. Further analysis of the data into diagnostic and 
sex categories must yet be made to eliminate entirely the possiblity that seasons have 
no influence on certain kinds of mental illness of either sex. 
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THE PERFORMANCE OF HIGH-GRADE RETARDED, EMOTIONALLY 
DISTURBED CHILDREN ON THE CHILDREN’S MANIFEST ANXIETY 
SCALE AND CHILDREN’S ANXIETY PICTURES 


SIDNEY ROSENBLUM AND ROGER J, CALLAHAN 
Wayne County (Mich.) Training School Eastern Michigan College 


PROBLEM 


In 1956, Castaneda, McCandless and Palermo? published their Children’s 
Manifest Anxiety Scale (CMAS) and since then a series of carefully conceived 
studies by the same authors have appeared detailing the relationships of the scale 
with children’s academic achievement and intellectual functioning“, sociometric 
status“, and performance in complex learning stiuations“: }°’, As with the adult 
form of the test, results to date with the children’s scale have been encouraging and 
highly suggestive for researchers and clinicians. 

In an effort to extend and investigate the usefulness of the CMAS with popula- 
tions other than the normal school children used by the Iowa group, the scale was 
utilized by the writers as part of a larger project?) undertaken to determine if 
tranquillizing medication is an effective therapeutic adjunct in the treatment of a 
population of high-grade retarded children manifesting various behaviors that are 
generally reflective of personal and social maladjustment. The present paper con- 
cerns itself with a presentation of normative data obtained with the CMAS, a report 
of results with another measure of “manifest anxiety’? — the Children’s Anxiety 
Pictures (CAP), and a discussion of some relationships that were obtained between 
the two techniques. 


METHOD 


Measures. Children’s Manifest Anxiety Scale (CMAS). As reported by Castaneda 
et al®) the CMAS consists of 42 anxiety items with 11 additional items called the 
Lie Scale that serve as an index of the child’s proclivity for falsifying responses to 
anxiety items. For use with our population the wording of some of the items was re- 
vised, but in general the scale was identical with the lowa test. Since most of our 
children could not meet the fourth-grade reading level required for self-administra- 
tion of the test, it was presented orally to each subject and responses were marked 
by the child on a specially prepared answer sheet. The index of the level of anxiety 
was obtained by summing the number of “‘yes’’ responses given to the items (e.g., 
“T worry most of the time”’; “I have bad dreams’’). 

Reliabilities averaging about .90 for the anxiety scale after one week have been 
reported for normal fourth, fifth, and sixth grade children“. Using a population of 
63 14-year old boys with a mean Binet IQ of 73.3, a split-half reliability of .72 was 
obtained for the anxiety scale by Keller“? in his study of the relationship of auditory 
memory span to learning ability. 


Children’s Anxiety Pictures (CAP). The CAP was devised by one of the writers 
2) and consists of 40 black and white ambiguous figures of varying shapes. Children 
are asked to tell what each picture looks like to them. Anxiety is defined as the 
tendency to perceive these ambiguous stimuli as representing “‘threatening”’ types of 
objects, such as ghosts, monsters, robbers, etc. The scoring approach is similar to 
Elizur’s® content scoring of the Rorschach. Weightings ranging from zero to three 
are given each response. In an unpublished study by Callahan of 30 11- and 12-year 
old high-grade retarded children, inter-judge reliability of this scoring system was 
found to be .90. Test-retest reliability after an interval of from four to six weeks 
was .85 for 36 13-year old high-grade retarded boys. 


Procedure. The experiment in which the above two measures were utilized lasted for 
a total of 16 weeks. The CMAS was administered to each child by clinical psycho- 
logists three times during the study—within one week prior to the onset of the experi- 
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ment, after a two-month period had elapsed, and post-experimentally.! The CAP 
was administered individually at identical intervals to all children by one of the 
writers(RC). Test procedure was kept constant for each administration. The re- 
sults of the first administration of these measures were used as data for this paper.’ 


Subjects. Thirty school-aged boys and girls chosen by cottage personnel and school 
teachers at the Wayne County Training School, a residential treatment-training 
center for high-grade retarded children with emotional problems, were used as sub- 
jects in the study. They were rated as children who were ‘‘troublesome”’ to an ex- 
treme degree, in that they required, at the time of selection, constant, close personal 
supervision from staff members and tended to disrupt any group in which they were 
participating at any given time. These children were free from neurological signs as 
determined by a thorough medical examination. Table 1 presents identifying in- 
formation about the subjects. 


TaBLeE 1. DescriptivE DATA FOR THE SuBsEcTS TOGETHER WITH MEAN 
ScoRES AND STANDARD DEVIATIONS FOR THE CMAS, L Scatgé anp CAP 
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RESULTS AND DISCUSSION 


CMAS, L Scale, and CAP. In table 1 are presented descriptive statistics for the 
CMAS, L Scale and CAP, based on results obtained from the first administration of 
each measure with our population. A ¢ of 6.72, significant at the 1% level, reveals 
that there is a tendency for our girls to score higher (7.¢., are more anxious) than boys 
on the CMAS. No such differences appear either on the L Scale or CAP, where com- 
puted i’s, .04 and 1.06 respectively, were found insignificant. Although direct com- 
parisons cannot be made between our CMAS results and those of normal children 
because of differences in test administration, it is interesting to note that a similar 
tendency was found with normal children; 7.e., girls gave significantly more ‘“‘Yes’’ 
responses to the items than boys. “? 


TABLE 2. CoRRELATIONS BETWEEN THE CHILDREN’S MANIFEST ANXIETY 
ScaALE, CHILDREN’ s ANxIeTy PicTURES AND L ScaLE FOR R Toran Group 








Measures CAP 


CMAS | 
CAP | 
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L Scale 


— .38* 
+.41* 
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*Significant at 5% level **Significant at 1% level 


‘The writers should like to acknowledge their thanks to Hedwig Minckiewicz, Ernest Smith, 
Janet Smith, and Milton Solomon for their aid in gathering experimental data. 

*Parenthetically, it is of interest to note that subsequent administrations of the CMAS and CAP 
revealed no significant decreases in anxiety (i.e., no improvement) either for subjects receiving tran- 
quillizing medication or for matched controls receiving placebos. 
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Relationships Among Measures. Correlational statistics obtaining among the three 
measures are presented in table 2, using total group data. All the correlations are 
significant at the 5% level or less. 

In their original study, Castaneda et al stated that lack of correlation between 
the L Scale and CMAS would be highly desirable, ‘assuming that the tendency, as 
reflected in the L Scale, for any given subject to falsify his responses to anxiety 
items could result in a high anxiety score as well as a low one.’’®: »- #2) Congruent 
with this assumption, results with a population of normal children produced correla- 
tions between the two measures clustering around zero. With our population, how- 
ever, a significant negative relationship was obtained. (That is, children with low 
anxiety scores tended to falsify their responses.) It is felt that the procedure of 
individual administration of the CMAS was the most important factor responsible 
for this finding. If, as Wirt and Broen suggest, the CMAS actually is testing a 
child’s ‘‘willingness to say deviant things about the self’’ “*. ». 482), then one explana- 
tion of our results is that these emotionally disturbed children chose to use denial 
when required to respond to a series of highly-personal, emotion-tinged statements 
about themselves without the protection or anonymity of a group-administered test 
procedure. No such defensiveness apparently is necessitated by the CAP, and this 
could account for the low but significantly positive correlation between this tech- 
nique and the L Scale. What is suggested by this finding, it seems, is that children 
who tend to falsify their CMAS responses are not doing so on the CAP probably 
because the test, due to its construction, does not provide them the opportunity. 

Related to this point is the significantly negative relationship found between the 
two measures purporting to tap the same variable of ‘‘manifest anxiety’’. Qualitative 
data suggested that taking the CMAS was not a pleasant experience for our child- 
ren. For example, many indicated, upon questioning, that they disliked taking it 
since it made them sad to be asked to reflect on their problem areas. Others seemed 
behaviorally depressed following administration of the test. The CAP, however, 
tended to elicit no such reactions. On the contrary, the task of looking at pictures 
and telling what they thought was represented seemed highly appealing to our 
children, and produced none of the behaviors or comments elicited in response to 
the CMAS. 

Two studies done at the Wayne County Training School have direct bearing on 
the question of the usefulness of these measures. In his investigation of WISC digit 
span performance in high-grade retarded boys, Keller“ did not confirm his hypothe- 
sis that those scoring high on this test would show less anxiety on the CMAS than a 
matched group scoring low. In fact, the trend was in the opposite direction, with the 
Highs appearing more anxious than the Lows, as measured by the CMAS. Since 
these results seemed contrary to clinical theory and previous findings on the pre- 
sumed effect of anxiety on digit span performance® '), Callahan and Keller“? re- 
tested the same groups one year later with the CAP, and found that, consistent with 
expectations, children with low digit span scores tended to perceive significantly 
more threatening objects in the ambiguous stimuli of the CAP (7.e., were more anx- 
ious) than did those with high digit span scores. 


SUMMARY 


The present study reports the performance of 30 boys and girls in residence at 
the Wayne County Training School on two measures purporting to tap “‘manifest 
anxiety”, the Children’s Manifest Anxiety Scale (CMAS), a questionnaire-type test 
adapted from the Taylor Anxiety Scale for adults, and the Children’s Anxiety Pic- 
tures (CAP), a projective-type test consisting of black and white ambiguous figures. 
The subjects in the study were part of a larger project investigating tranquillizing 
medications, were all adjudged emotionally disturbed, and were considered free from 
neurological signs as determined by medical examination. The following tentative 
conclusions seem warranted by the data obtained: 
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(1) There is a tendency for a population of high-grade retarded, emotionally 
disturbed girls to score significantly higher than a comparable group of boys on the 
CMAS. No such differences appear on the CMAS L Scale or the CAP. 


(2) Individual administration of the CMAS, necessitated by the limited reading 
skills of the subjects, tends to produce a defensive set in the children, raising a ques- 
tion as to the validity of the low-anxiety scores thus obtained. 


(3) Children who tend to falsify their CMAS responses apparently are not doing 
so on the CAP. 


(4) There is clinical evidence to suggest that the CAP is a less threatening 
measure for our children than is the CMAS when individually administered. 

(5) Although further research is required, there are suggestive data to indicate 
that for a population of high-grade retarded, emotionally disturbed children the 
CAP is a more useful and practical instrument than is the CMAS with which to in- 
vestigate so-called ‘‘manifest anxiety’’. 
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THE ROLE OF ANXIETY IN PSYCHODIAGNOSIS: 
REPLICATION AND EXTENSION! ? 


LEONARD P. ULLMANN AND WILLIAM A. HUNRICHS?® 


Veterans Administration Hospital, Palo Alto, California 


Current nosological thinking, as exemplified by the American Psychiatric Asso- 
ciation’s Diagnostic and Statistical Manual “®), makes use of terms such as anxiety, 
reality relationships, and concept formation as attributes distinguishing between 
diagnostic groups. Studies by Ash“, Doering“, Mehlman“” and recently Schmidt 
and Fonda“*) have demonstrated that diagnostic categorization may introduce con- 
siderable rater error. However, diagnostic categorization is readily obtainable in- 
formation and offers the research worker a criterion for studies of the variables which 
conceptually differentiate the diagnostic groups. Possible difficulties in this proced- 
ure were brought into focus by Zimet and Brackbill“®. In their study, scores of four 
common and relatively well validated test measures of anxiety were examined for 
97 patients tested at a VA General Medical Hospital whose discharge diagnosis had 
been neurosis, psychosis, or personality disorder. The only differentiation between 
diagnostic groups by the test measures of anxiety was that on the Welch Anxiety 
Index “*) neurotics saw themselves as less anxious than the other two groups“® 
p. 175) 

The importance for research of the finding that the frequently used criteria of 
tests and diagnostic categorization are not consistent led to the present study. This 
investigation (a) repeated Zimet and Brackbill’s study using patients admitted to a 
neuropsychiatric hospital (and therefore presumably more severely ill); (b) gathered 
data of additional test measures of variables of mental functioning used in nosologi- 


cal conceptions; and, (c) compared three test measures of anxiety on groups of 
patients given the same diagnosis at two different hospitals. 


METHOD 


Every male patient entering a VA neuropsychiatric hospital admission ward 
during three months, who was not brain injured or over 50 years of age, was ad- 
ministered the research test battery within two weeks of admission. Eighteen 
months after testing, the established clinical diagnosis was determined from the dis- 
charge summary, or if the patient remained in the hospital, from the progress notes. 
The sample of 90 patients differed from Zimet and Brackbill’s sample in that the 
percentage of psychotics was far greater in the present study, the percentage of 
personality disorders far lower. A second difference between the studies was that 
Zimet and Brackbill used records of patients obtained by referral and might have 
obtained a sample of difficult diagnostic problems. The present sample of all patients 
entering the admission ward during three months may be assumed to have included 
the easy diagnostic decisions as well as the difficult ones. A third difference between 
this investigation and that of Zimet and Brackbill was that in this study all data 
were collected by the same examiner while in the previous study tests were admin- 
istered by a number of different examiners. The present study eliminated the 
possibility that the lack of differentiations between the groups reported by Zimet 
and Brackbill was a result of variation in test administration. 

Three of the four measures of anxiety used by Zimet and Brackbill were re- 
peated in this study. These tests were the Welch Anxiety Index“ and the Psy- 


1Based on a summarized paper read at the Convention of the American Psychological Associa- 
tion, New York City, 1957. 

2From The Psychology Research Laboratory, V. A. Hospital, Palo Alto, California. 

3Now at San Diego State College, San Diego, California. 
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chasthenia scale of the MMPI“? as measures of verbalized anxiety, and Elizur’s 
Rorschach Content Test (RCT) © as a measure of non-verbalized anxiety. 


The Digit Span subtest of the Wechsler-Bellevue“: »- '*® used by Zimet and 
Brackbill as a measure of the interference aspects of anxiety was not included in this 
study. However, the vocabulary scale of the Wechsler-Bellevue was used as a “‘hold’’ 
measure for comparisons with the Shipley-Hartford Reasoning Scale“) and the 
Arithmetic Reasoning Scale of the Army Alpha, Form Six“, These data provided 
information with which to investigate deterioration of mental functions, an area 
which touches on that investigated by the Digit Span subtest. Work such as that of 
Babcock), Becker“, and Shipley“* strongly argues for the hypothesis that 
schizophrenics should do relatively better on a vocabulary test than on a reasoning 
or problem solving task. For purposes of diagnostic work, the average difference, 
vocabulary minus reasoning, should be greatest for the psychotic group. On the 
other hand, if anxiety interferes with complex behavior to a greater extent than it 
interferes with over-learned behavior, and if neurotics are more anxious than psycho- 
tics, then neurotics would be expected to do relatively better on a vocabulary test 
than on a reasoning or problem solving task. For diagnostic purposes, the average 
difference, vocabulary minus reasoning, should then be greatest for the neurotic 
group. 

A frequently used nosological concept is that of appropriate response to external 
reality; e.g., psychotic disorders “‘are characterized by a varying degree of personality 
disintegration and failure to test and evaluate correctly external reality in various 
spheres.”’ “8. P. 4) Three measures based on the MMPI were used as verbal reports 
by the subject of his relation to external reality. These measures were the Barron 
Ego-Strength Scale“, the Gough Intellectual Efficiency Scale ®, and the Block Ego- 
control Scale“. Two scores were included from McReynolds’ Concept Evaluation 
Technique (CET) “°, a method in which the subject is asked to evaluate the ‘‘good- 
ness of fit’’ between 50 suggested concepts and specified portions of the Rorschach 
cards. The two CET measures used in this research were the C score, the number of 
times the patient agrees with the standardization group, and the V score, a measure 
of the variability of the patient’s standards in judging the “goodness of fit’’ of the 
suggested concepts. The first score compares the patient’s judgment with that of 
normals, the second compares the patient’s judgment with a norm he himself has 
set up. 


RESULTS 


Zimet and Brackbill found that only the WAI differentiated the three diagnos- 
tic groups. In the present study, the WAI did not significantly differentiate the 
groups, but the Pt scale and the MMPI (F = 4.98) and the RCT (F = 6.11) yielded 
significant group differences. As can be seen in Table 1, the personality disorders 
were the group with the highest anxiety scores. 

Extending Zimet and Brackbill’s work with additional test measures, no signi- 
ficant differences were found in the level of intellectual functioning of the three 
groups as measured by the raw scores of a vocabulary and two reasoning scales. 
Differences between the standard scores of the vocabulary and reasoning tests did 
not differentiate the groups. Two of the three MMPI scales dealing with reports of 
reality relationships did distinguish between the groups. These results indicated that 
the group of personality disorders had lower Ego-strength and Ego-control than the 
psychotics. Because an MMPI measure of anxiety had differentiated between the 
three diagnostic groups, the correlations between Pt and these two scales were com- 
puted. For the sample of this study, the Pt scores correlated —.83 with the scores of 
the Barron Ego-strength scale, and .71 with scores of the Block Ego-control scale. 
Since correlation between the WAI and Pt was .82 for this sample and .78 for the 
Zimet and Brackbill sample, “* T#>!e 2. P. 176) the scales of control used in this study 
seemed to differ little from measures of verbalized anxiety. Finally, no differentia- 
tion of the groups was obtained from the two CET scores. 
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TABLE 1. ComPaRISoN oF 3 D1aGNostic SAMPLES AT NEUROPSYCHIATRIC AND GENERAL MEDICAL 
Hospitats: Ns, MEANS, SIGMAS, AND t-RaTIos FoR 3 INDICES OF ANXIETY 
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The data presented in Table 1 dealing with a general medical hospital population 
were taken from Zimet and Brackbill“® Table !. p. 1%) while the neuropsychiatric 
hospital sample was collected in the present study. On the three test measures re- 
ported, patients diagnosed psychotic at the medical hospital had somewhat higher 
anxiety scores than patients in the same group at the NP hospital. Patients diag- 
nosed as personality disorders at the medical hospital had somewhat lower scores on 
the tests of anxiety than patients given the same diagnosis at the NP hospital. On 
the two measures of verbalized anxiety, the patients diagnosed neurotic at the medi- 
cal hospital had lower scores than the same group at the NP hospital. The reverse is 
true for the measure of non-verbalized anxiety; the neurotics at the medical hospital 
had higher scores than the same diagnostic group at the NP hospital. 


SUMMARY 


Zimet and Brackbill“® reported disappointing differentiations by test measures 
of anxiety between diagnosed neurotic, psychotic, and personality disorders at a 
general medical hospital. The present study replicated Zimet and Brackbill’s in- 
vestigation using patients at a neuropsychiatric hospital. Additional test measures 
of intellectual functioning were also studied. Comparisons made between patients 
with different diagnoses at the same type of hospital and between patients with the 
same diagnosis at different types of hospital led to the conclusion that there is no 
simple or direct relationship between tests purportedly measuring personality var- 
iables and the use of such concepts as manifested by psychiatric classification. 

While high correlations between a test device and psychiatric categorizations 
may have practical applications, significant statistical agreement does not insure the 
identity of the lines of reasoning that led to the obtained agreement. It is therefore 
argued that diagnostic categorization should not be considered a strong criterion for 
psychological research. 
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THE OVERT-COVERT ANXIETY INDEX AND HOSTILITY 
MOSS L. RAWN 


New York University 


PROBLEM 


In a new test of anxiety, Cattell’s ‘“IPAT Anxiety Scale’, a measure of the re- 
lationship between overt and covert anxiety is presented, ‘‘. . . the value of which is 


to be interpreted in the light of clinical experience with the test .. .””%. ».”. To eval- 
uate the contribution of this overt-covert anxiety index for hypotheses testing, the 
relationship between anxiety and hostility was measured by the Cattell scale and the 
writer’s modification! of Grace’s®: *) method respectively. The theory is advanced 
that covert anxiety is related to diminished expression of physical hostile expression. 
Psychoanalytic theory provides the intervening variable linking anxiety and hostil- 
ity: the control function of the ego. When ego control fails, anxiety becomes overt 
and direct expression of physical hostility a likely accompaniment. 


PROCEDURE 


Twenty-two female college students were group administered the IPAT Anxiety 
Scale and Grace’s Hostility Inventory without prior explanation of hypotheses or 
purposes. Intelligence, age, socio-economic status, occupation, and sex did not 
materially influence the results. Intelligence and age each yielded correlations in- 
significant at the .05 level of confidence for overt-covert ratio and for hostility score.” 
Sex was controlled by arbitrarily eliminating the numerically fewer males. Socio- 
economic background was uniformly lower-middle class and occupation was almost 
exclusively limited to that of nursing. 


RESULTS 


The total group was dichotomized at the overt-covert ratio score mean with 
those in the upper half (high overt relative to covert anxiety) compared to those in 
the lower half with respect to number of direct physical hostile responses simply 

1The “Everyday” situations category was modified slightly with the ‘Professional’ and ‘“‘Inter- 
national”’ situations categories eliminated as superfluous to present design. 

*Vocabulary scores of the Wechsler-Bellevue Adult Intelligence Scale Form I were available and 
utilized as approximate indices of intelligence, a level of refinement adequate for the purposes of the 
study. The correlation coefficients (all insignificant) were as follows: age: overt-covert ratio, r —.08; 
age: hostility score, r .05; vocabulary: overt-covert ratio, r .32; and vocabulary: hostility score, r .33. 
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absent or present.’ It was found that exactly one-half of the total group of twenty- 
two subjects expressed no physical hostile response while the remaining half showed 
one or more responses, a finding which allowed a convenient point of cleavage. Note- 
worthy is the rarity of verbal reporting of a physical hostile propensity, a factor per- 
haps related to social disapproval of aggression in the middle class. This makes the 
appearance of even one such physical hostile response revealing. 

Chi-square analysis of the matrix formed by the two dichotomies (overt :covert 
anxiety and absent: present physical hostile response) yielded a value corrected for 
continuity with one degree of freedom, of 4.90 which is significant at the .05 level of 
confidence. 

SUMMARY 

It is concluded therefore that those least prone to verbal indication of physical 
hostility show most covert anxiety. Similarly, those most given to physical hostile 
expression have greater overt anxiety. The hypotheses concerning the relationship 
between anxiety and hostility is confirmed within the limits imposed by the simplest 
of illustrative analysis. The overt-covert ratio of Cattell shows promise for use in 
more elaborate investigations of psychological theory. 
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’An example of a direct physical hostile response, placed in italics, follows a sample question: 
‘While you are shopping in a store the clerk accidentally spills perfume on you. Which one of the 
following would you feel most like doing: Throwing the bottle at the clerk.@:*) The item answer repre- 
sents the physical hostile alternative of a four response choice task. The three other choices do not 
involve direct physical hostile expression. 


THE INFLUENCE OF INSTRUCTIONAL SET UPON THE 
DISCOMFORT-RELIEF QUOTIENT 


DELL LEBO AND WILLIAM APPLEGATE! 
Richmond Professional Institute 


The Discomfort Relief Quotient (DRQ)® is based on a count of words sug- 
gestive of discomfort or relief. It has been employed frequently as a reliable measure 
of tension during psychotherapeutic processes. An examination of experimental 
work on the DRQ reveals a close correspondence between discussion topic and DRQ 
tension. Early in therapy, for example, when a great number of problems are men- 
tioned, DRQ’s are high. Later, when the patient states fewer problems, lower 
DRQ’s occur, This systematic relationship is apparent under a variety of condi- 
tions ®: ©), It can even be observed in a study reporting a —.05 correlation between 
DRQ’s derived from free verbalizations and clinical ratings of tension®’. Such dis- 
cernment is possible when the procedure of the investigation is examined closely. 
In the first place, tension ratings were not made at the time protocols for the DRQ 
were secured. Further responsibility would seem to lie in the directions themselves. 
‘“‘The patient . . . is asked to tell what is on his mind, or what he is thinking about.” 
(6, p. 658) Jt seems possible that it was the topic of the patient’s conversation that 
determined his DRQ. If his topic was one of relief, his DRQ would be low. Indeed, 
this reasoning is supported by the statement of one of the paranoid subjects with a 
high rating of tension but a low DRQ who spoke of the beauty of lower body parts. 


1Our thanks are due Prof. Edwin 8. Thomas for his help in securing subjects. 
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The relationship between topic and tension has been noted in a variety of ways. 
Gestural movements, for example, are more frequent “? and muscle tension increases 
4) when stressful situations are discussed. Despite the close relationship between 
tension and topic the specific role of the topic under discussion as a determinant of 
DRQ seems to have been largely overlooked. 


PROBLEM 


These considerations suggested both a mode of attack on the problem of tension 
indices and an hypothesis for investigation. The hypothesis was that DRQ is re- 
lated to protocol topic; DRQ’s derived from stories concerned with an unhappy 
event will be highest, those from writings about a happy event lowest and protocols 
produced in regard to a neutral event will yield intermediate DRQ’s. The present 
investigation attempted to produce, by specific instructions to the subjects, a fac- 
simile of DRQ changes as they occur in therapy. 


METHOD 

The subjects used in this investigation were 55 female college students, between 
the ages of 17 to 27 years, taking introductory classes in psychology. Some months 
prior to testing, the most uneventful, or neutral, day of the week was ascertained. 
This day was located by asking the classes (N = 75) from which the subjects were 
drawn to indicate the day of the week regarded as the most exciting and the one held 
as least exciting. Thursday received no nominations. Hence it was regarded as a 
day without positional stimulation. 

On a Friday, subjects were required to write about three topics for 10 minutes 
each. The topics, given singly, were: (a) what they did yesterday, 7.e., the neutral 
day N; (b) the most unhappy event in their life, U; and (c) the happiest event in 
their life, H. 

The subjects were divided into two groups with a different order of presentation. 
The first group consisted of 21 females who wrote on topics NV, U, and H. The second 
group, with 35 females, wrote in a sequence of N, H, and U to avoid any possible 
effects of fatigue on the DRQ of the last written story. H and U were, however, kept 
in juxtaposition to highlight the hypothesized sudden change in DRQ. All of the 
protocols were collected by the junior author. 

Another person coded and scrambled the stories in an attempt to prevent the 
scorer from knowing which stories were written in response to definite instructions. 
The story protocols were scored in accordance with the instructions for determining 
DRQ on the basis of word count. 

The reliability of this measure, as usually reported, is .80 or above. When proto- 
cols of 10 per cent of the subjects, selected on the basis of a table of random numbers, 
were scored by a second judge, a correlation of .88 was found. The judges then con- 
ferred about major consistent scoring differences. Following this discussion, all the 
protocols were rescored. Again, 10 per cent of them were randomly selected and re- 
checked. The rescoring correlation was .99. 


Statistical Analysis. The mean DRQ values were examined by means of t scores 
computed from the standard error of a difference between uncorrelated means. In 
this measure, the standard error is overevaluated usually with the result that the ¢ 
ratio is smaller than would have been the case if the standard error of a difference 
between correlated means had been applied. 

The rank difference coefficient was used to examine the relationship between 
DRQ’s because there were fewer than 30 subjects in one group. A Pearson product- 
moment correlation was used as a companion statistic because of the inevitable large 
number of zero scores in at least one cluster of protocols, e.g., the happy event stories. 
In such cases a ranking, that is, a common rank based on the means of the ranks in- 
volved in such ties, was not truly representative of extremely low DRQ’s. 
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RESULTS 
The means of the different variables and the resultant t’s are shown in Table 1. 
In all cases save that of the comparison of the DRQ means of the neutral and un- 
happy stories of the group, a ¢ significant at the .01 level was found. 
TaBLe 1. Means anp t Scorers AMona DRQ’s ror Neurrat (N), 
Unnappy (U), anp Happy (H) Topics 
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a) 
65 


11 
*Significant at .01 level of confidence. 


Correlations, both rho and r, were also obtained for all combinations of DRQ 
scores. No significant correlations resulted for either group. 





DISCUSSION 

Inspection of the results in Table 1 indicates that the hypothesis has been borne 
out. The mean DRQ was highest for unhappy stories, lowest for happy stories, and 
intermediate for neutral stories. This was true for both groups. All of the differences 
between the means, save one, were significant at the .01 level. 

Considering the present results, the method of obtaining DRQ’s by instructions 
appears to be a promising one. For it can be used to obtain rapidly DRQ’s for com- 
parison with other objective measures of tension or anxiety, such as palmar sweating 
or anxiety scales. In addition to demonstrating the possibility of producing by in- 
structions a reproduction, differing in scale, of some of the changes which take place 
during the therapeutic process, the present experiment implies that the utility of 
DRQ measures may be limited by their tendency to reflect the topic under dis- 
cussion. A counselor or interviewer could influence the DRQ at will by suggesting 
topics for conversation either by asking direct questions, requesting certain informa- 
tion, or nondirectively commenting on particular statements thus encouraging the 
patient to continue in the same vein. 


SUMMARY 

There are many hints and observations in DRQ literature on the relationship . 

between topic and DRQ. This investigation studied the influence of different in- « 

structions on the Discomfort Relief Quotient. Fifty-five young women were told to 

write happy, neutral, and unhappy stories. Significant differences were found be- 
tween DRQ’s of hypothesized magnitudes obtained under these conditions. 
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RELATIONSHIPS BETWEEN THE ADJECTIVE CHECK-LIST, 
PERSONAL PREFERENCE SCHEDULE AND DESIRABILITY FACTORS 
UNDER VARYING DEFENSIVENESS CONDITIONS 


ALFRED B, HEILBRUN, JR. 
State University of Iowa 


PROBLEM 

The Adjective Check-List (ACL) is a relatively new test devised by Gough 
for personality research. It consists of 300 adjectives which can be checked as being 
characteristic or not characteristic of an S’s behavior. Two of the advantages sug- 
gested by Gough for this technique are that by using words familiar to the rater a 
meaningful task is provided and by having a presence-or-absence checking system 
analytic ease is assured. It would seem, however, that these characteristics would 
also tend to enhance the influence of social desirability in item selection, especially if 
used as a self-rating technique. 

The Personal Preference Schedule (PPS), another recent objective personality 
test, was designed by Edwards? to measure fifteen manifest needs earlier described 
by Murray“. In contrast to the above technique, Edwards utilizes a forced-choice 
method which requires Ss in selecting characteristic behaviors to choose between 
pairs of statements roughly matched for judged social desirability. At least one 
study “ has lent support to the notion that this forced-choice procedure is highly 
effective in eliminating social desirability as a source of measurement error. 

Several other studies have emphasized the influence of the social desirability 
factor in various modes of objective personality measurement including true-false 
questionnaires ®: 5. *. %), rating scales, and Q sorts“: ®, It was the purpose of this 
study to evaluate the relationships between judged desirability of need variables and 
the strength of these need variables as inferred from ACL and PPS performance. 
These relationships were studied under experimental conditions intended to heighten 
and minimize, respectively, a defensiveness set. Both personal and social desirability 
factors were considered, since the results of a study by Rosen showed that differ- 
ing response patterns were obtained on the MMPI when Ss were instructed to sort 
cards according to personal desirability of the items and when social desirability 
was the sorting criterion. It was further intended to investigate the ability of the 
ACL to predict relative need levels of Ss as defined by PPS scores. 


PROCEDURE 


A group of 90 undergraduates enrolled in a psychology course at the State Uni- 
versity of lowa were group administered the PPS under standard conditions except 
that the Ss did not record their names on the test, only their assigned experiment 
numbers. One week later these Ss were recalled together and split into two groups 
by sending half the Ss into another testing room. The split was determined by a pre- 
arranged random selection of experiment numbers controlled only for ratio of sexes 
in each group. Once the split had been made, a different experimental condition was 
imposed on each group. 


Defensiveness Condition. The Ss whose numbers had been read aloud were told by E (after intro- 
ducing himself as a psychology staff member) that they had all shown a consistently normal set of 
scores on the PPS and should then be a well-adjusted group according to this test, However, they 
were informed that the PPS was a new, unproved instrument and that forthcoming rformance 
on the ACL would provide a yardstick for evaluating it. Before starting the ACL these Ss were 
required to record their names, addresses, and telephone numbers and told that should any marked 
discrepancy appear between the personality pictures obtained on the PPS and ACL, E would re- 
quest a personal interview to discuss the differences. Further, two advanced graduate students, 
both of whom had served in a teaching role with these Ss, served as proctors for the test. Ss were 
required to take alternate seats. These conditions were intended to produce a test-taking set of 
appearing normal. 

Following administration of the Check-List with instructions to check adjectives character- 
istic of themselves (ACL-real), the Ss were then requested to complete another Check-List marking 
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those adjectives which they would most like-to have as self-characteristic (ACI-ideal). Finally, 
they were asked to rank order the 15 needs scored on the PPS in terms of personal desirability. 
The need descriptions given by Edwards®) were provided, and the Ss were asked to rank their 
preferences for high levels of these needs. 


Frankness Condition. Whereas in the other condition much had been attempted to encourage a 
“look normal’’ kind of defensiveness, in this condition efforts were directed toward making the 
situation non-threatening and encouraging frankness. A serious but youthful appearing graduate 
assistant conducted the group with no proctors. Alternate seats were not required. Anonymity 
was maintained. The Ss were exhorted to cooperate by providing an honest self-appraisal. Finally 
this group was assured that the test they were about to take (ACL) was an experimental one and 
would be evaluated by comparing it with the initial test (PPS). No comment was made regarding 
the outcome of scores on the PPS for this group. Following ACL-real administration, ACL-ideal 
was administered and need preference ratings were obtained as in the other condition. 


ACL Need Scoring Keys. Twenty graduate student judges, all of wham had previous experience 
with psychological tests, were asked to select those ACL adjectives which would characterize a 
person with high levels of each of the 15 PPS needs. Descriptions of each need were provided for 
continual reference. Agreement among at least 9 of the 20 judges was chosen as the criterion for in- 
clusion of an adjective in a need key. This particular level of agreement seemed to best provide an 
adequate number of adjectives in each key without unduly sacrificing communality of selection. 
The number of adjectives included in these 15 keys ranged from 19 (succorance) through 33 (ag- 
gression).' For purposes of this study the assumption was made that the more keyed adjectives 
checked, the higher the particular need level. Though this remains an unverified assumption, it 
seems generally consistent with the procedures used in making clinical inferences regarding per- 
sonality from objective tests. 

Failures in attending the second experimental hour and eliminations based on PPS consistency 
scores of less than 9 reduced the total N to 77. These were divided in the following fashion: de- 
fensiveness condition, N = 38 (27 males; 11 females); frankness condition, N = 39 (25 males; 
14 females). 


RESULTS 


The relationships between personal desirability of needs, the experimental 
conditions, and test performance can be evaluated by inspection of the correlations 


found in Table 1. It can be seen that the personal desirability of needs is highly cor- 


TaBLe 1. Ruo INTERCORRELATIONS BETWEEN NEED PERSONAL DEsIRABILITY Ratinas, ADJECTIVE 
Cueck-List Scores, AND PERSONAL PREFERENCE SCHEDULE NEED Scores UNDER DEFENSIVE 
AND FRANK CONDITIONS 

ACI -real ACL-ideal ACL-real ACL-ideal PPS PPS 
(Defensive (Defensive (Frank (Frank (Defensive (Frank 
Condition) Condition) Condition) Condition) Group) Group) 





Desirability 
Ratings .86** .88** Rr a sae .63* 56" 
PPS .60* .57* .60* .64* 








*indicates p of <.05 
**indicates p of <.01 


related with ACL performance under defensiveness-inducing conditions and moder- 
ately correlated under frankness-inducing conditions. The PPS also showed mod- 
erate and significant correlations with desirability ratings. Under neither condition 
did the switch from real to ideal self-descriptions affect the correlation between 
personal desirability ratings and ACL performance. Significant correlations between 
need scores on PPS and ACL were also obtained. These correlations were not affect- 
ed by the type of self-description on the ACL or by the experimental conditions, 

The relationships between personal desirability, the experimental conditions, 
and test performance were analyzed in another fashion. Gough“? provides two lists 
of adjectives which represent the 75 judged most socially favorable and 75 judged 
least socially favorable of the 300 found on the ACL. Table 2 presents the mean 
number of adjectives from these lists endorsed by Ss on ACL-real and ACL-ideal. 
It can be seen that Ss under the defensive condition endorsed significantly fewer un- 

1These need keys may be obtained without charge from Alfred B. Heilbrun, Jr., Department of 
Psychology, State University of Iowa, Iowa City, Iowa. 
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TaBLE 2. MEAN NUMBER OF FAVORABLE AND UNFAVORABLE ADJECTIVES 
ENDORSED ON ACL-REAL AND ACL-IDEAL BY SS IN FRANK (N = 39) AND 
DEFENSIVE (N = 38) GROUPS 








Defensive Condition Seankk Condition 
ACL-real ACL-ideal ACL-real ACL-ideal 


Favorable 52.24 64.84 51.74 62.26 
Unfavorable 7 .34* 2 47 11.33* 2.67 
*This comparison, using the Medians test because of the skewed distribution, vie elded a X? value 
of 3.55 with 1 df., significant at <.05 level (one-tailed test). No other pertinent between-condition 
difference was significant. 





favorable items as being self-descriptive. (7.e., on ACL-real) than did Ss under the 
frank condition. They did not endorse more favorable items as being characteristic, 
however. There were no significant differences on ACL-ideal. 

A comparison of the relative saturation of the 15 rationally derived ACL need 
keys with these favorable and unfavorable adjectives was made using the ratio, 
number favorable - number unfavorable 
total number keyed adjectives ; 
lative frequency of favorable adjectives on the key in question. The ratio scores for 
the 15 keys were then ranked in order of decreasing magnitude. The rho correlation 
between these ranks and those for mean need scores on ACL-real was .98 for both 
the defensive and frank groups. A mean need score refers to the average percentage 
of adjectives on a particular key endorsed by Ss in a group. 

A final index of performance which was analyzed relative to the experimental 
conditions was the total frequency of response. The mean number of adjectives 
checked (out of a possible 300) for Ss under each condition on ACL-real and ACL- 
ideal is found in Table 3. 


TaBLeE 3. Mean Totat NuMBER OF ADJECTIVES CHECKED BY SS IN 
DEFENSIVE AND FRANK Groups ON ACL-REAL AND ACL-IDEAL 





A high ratio would then indicate a high re- 








Defensive Frank 
Group Group CR p 


ACL-real 108.18 120.28 2.00 05 
ACL-ideal 121.21 119.92 24 — 








Note: The within defensive group difference i is significant beyond the .0001 
level of confidence. 


DISCUSSION 


The results of this study support the contention that a self-rating check-list of 
adjectives elicits performance from college Ss highly related to personal and social 
desirability factors. Such support stems from the rather high rho correlations be- 
tween frequency of adjective endorsement in adjective clusters judged to character- 
ize certain needs and personal desirability ratings of these needs (¢ = .86, .72). 
Even more compelling are the very high correlations between frequency of adjective 
endorsement and a ratio reflecting the relative saturation of the adjective clusters 
with words previously judged to reflect favorably or unfavorably on the endorser. 
These correlations (¢ = .98, .98) reflect a very high positive covariation between 
frequency of endorsement of keyed adjectives and saturation of the keys with favor- 
able adjectives. 

One issue which cannot be critically evaluated with the evidence obtained from 
this study is the amount of error introduced into personality prediction from the 
ACL by this relationship between frequency of endorsement and personal or social 
desirability of the items. It seems reasonable to assume that if test taking behavior 
is this highly related to the desirability factors, even when instructions to the con- 
trary are given, then other non-test behaviors which reflect the needs we are trying 
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to predict would be similarly related (7.e., our manifest behavior is to some extent 
dependent upon social and self approval). However, if ACL test taking behavior 
were more highly related than the behaviors that are to be predicted, the desira- 
bility factor would contribute to predictive error. This appears likely in light of the 
correlations obtained. 

The effects of the experimental defensive and frankness conditions are not 
entirely consistent. While the somewhat higher correlation between personal de- 
sirability ratings and frequency of keyed adjectives checked for the defensive group 
(o = .86) than for the frank group (o = .72) is suggestive of increased defensiveness, 
the equally high correlations of .98 between frequency and the favorable adjective 
saturation of the keys indicates no differences between groups. 

If the total number of adjectives checked is examined as a variable, however, an 
apparent experimentally-induced difference between these groups is found. Under 
the defensive condition, Ss endorsed on the average 108.18 adjectives as self-descrip- 
tive while the frank group averaged 120.28. This difference is significant at a p = 
<.05 level. When asked to check with an ideal-self set, there was no difference in 
group means (defensive = 121.21; frank = 119.92). These results suggest that con- 
striction of responsiveness was one major effect of the defensive conditions when the 
ACL was used as a real-self descriptive instrument. 

Further light was shed on the effects of the experimental conditions and more 
specifically on the nature of the response constriction when the mean number of 
favorable and unfavorable adjectives endorsed by these groups was determined. 
It was found that the defensive group checked significantly fewer unfavorable ad- 
jectives on the average (7.34) than did the frank group (11.33) while making real- 
self descriptions. This difference would seem to be more than a reflection of the gen- 
eral decrease in responsiveness by the defensive group on ACL-real since there was no 
difference between groups on mean number of favorable adjectives checked (de- 
tensive = 52.24; frank = 51.72). Rather it seems reasonable to reverse this and say 
that the fewer unfavorable adjectives checked partially produced the constriction 
observed in the defensive group on ACL-real. The generalization which suggests it- 
self here is that college Ss when attempting to put themselves in a good light (7.e., 
look normal) will tend to say fewer unfavorable but not to say more favorable things 
about themselves. It is interesting to note the consistency of social stereotypes found 
in the ACL ideal-self descriptions. The Ss in this study endorsed on the average 
some 85 per cent of the adjectives independently judged to reflect favorably on the 
endorser while checking only about 3 per cent of those judged unfavorable when 
given an ideal-self set. 

The correlations between mean need scores on ACL-real and those obtained on 
Edwards’ PPS were significant (p = < .05) and unaffected by experimental condi- 
tions (o = .60, .60). Since both ACL-real and PPS showed significant positive covar- 
iation with personal desirability ratings (see Table 1), a first-order partial correla- 
tion between the two was determined with personal desirability held constant. The 
partial correlation of .35 was not significant at the .05 level. The correlations be- 
tween personal desirability ratings and PPS need scores (¢ = .63, .56) suggest a con- 
siderably higher relationship between the desirability factor and PPS performance 
for these college Ss than for the student nurses reported by Navran and Staufacher 
(o = ~-.01)“, Further investigation is necessary to clarify whether these discrepant 
findings reflect differences in personal and social desirability, sampling fluctuation, 
or other factors. 

A more definitive evaluation of the rationally derived need keys for the ACL 
would have best been accomplished by using non-test criteria of need levels, since 
the PPS has not as yet gained such criterion status. However, it seems obvious that 
need measurement by judgmental keys on the ACL still presents many problems 
which must be dealt with .before it could become an effective technique. Among 
these are the improvement of the keys, exploration of possible sex differences, and an 
attempt to control for dissimulation. 
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SUMMARY AND CONCLUSIONS 


This study was concerned with the relationships between the personal and social 
desirability factors, need levels as inferred from rationally derived keys on the Gough 
Adjective Check-List, and need scores on the Edwards Personal Preference Schedule. 
These relationships were examined under experimentally induced conditions intend- 
ed to increase defensiveness and to increase frankness. The Ss were 77 college under- 
graduates enrolled in a psychology course. The major findings were: 


1. Significant and moderately high rho correlations were found between test- 
defined need levels and personal desirability ratings for these needs. 


2. Extremely high positive rho correlations were obtained between relative 

frequency of endorsement of adjectives keyed to specific needs and satura- 
tion of these keys with adjectives independently judged to reflect favorably 
upon the endorser. 
The Ss exposed to the defensiveness-inducing condition made significantly 
fewer checks on the ACL than the experimentally-defined frank group when 
instructed to make a real-self description. There was no difference when 
these groups were given an ideal-self description set. This was interpreted 
as a response constriction related to an induced increase in defensiveness. 


The Ss under the defensive condition endorsed significantly fewer ad- 
jectives independently judged to reflect unfavorably on the endorser than 
did the frank group. There was no difference in the number of favorable 
adjectives endorsed. This supports the notion that under a “look normal” 
defensiveness set, college Ss tend to decrease self-derogation without in- 
creasing self-commendation. 

Need levels as inferred from rationally derived keys on the Adjective Check- 
List were positively and significantly correlated with need levels scored on 
the Personal Preference Schedule. This correlation was not significant when 
the personal desirability factor was partialled out. 


REFERENCES 


1. Epwarps, A. L. Social desirability and Q sorts. J. consult. Psychol., 1955, 19, 462. 
2. Epwarps, A. L. Manual for the Edwards Personal Preference Schedule. New York: Psychological 
Corp., 1954. 
Epwarps, A. L. The relationship between the judged desirability of a trait and the probability 
that the trait will be endorsed. J. appl. Psychol., 1953, 37, 90-93. 
4. Gouacu, H.G. Reference handbook for the Gough Adjective Check-List. Berkeley: The University of 
California Institute of Personality Assessment and Research, 1955 (mimeographed). 
5. Han ey, C. Judged social desirability and probability of endorsement of items on the MMPI 
Se and D scales. Amer. Psychologist, 1955, 10, 404-405 (Abstract). 
6. Kenney, D. T. The influence of social desirability on discrepancy measures between real self 
and ideal self. J. consult. Psychol., 1956, 20, 315-318. 
7. Murray, H. A., et al. Explorations in personality. New York: Oxford Univ. Press, 1938. 
8. Navran, L. and Sraurracuer, J. C. Social desirability as a factor in Edwards’ Personality 
Preference Schedule performance. J. consult. Psychol., 1954, 18, 442. 
9. Rosen, E. Self-appraisal, personal desirability and perceived social desirability of personality 
traits. J. abnorm. soc. Psychol., 1956, 52, 151-158. 











A SURVEY OF PAINFULNESS CONCEPTS 
DONALD V. PETROVICH! 


Veterans Administration Hospital, 
Jefferson Barracks, Missouri 


PROBLEM 


Pain is a vague and individualized concept deserving more experimental in- 
vestigation than has been reflected in the literature. In attempting better to under- 
stand pain reactions, it is helpful to ascertain those things which people consider 
painful, and their evaluation of these painful things. To accomplish this, a pain 
questionnaire was devised and the obtained responses were tabulated, categorized 
and analyzed. 


METHOD 


The ‘Pain Survey’’. Designed to ascertain situations which are considered pain- 
ful®, this survey was purposely kept vague and simple with directions as follows: 
“There are many things which make us feel pain. The purpose of this survey is to 
get some idea of the experiences which people consider painful. List below ten situa- 
tions which you associate with pain.” 

After the painful situations had been listed, the subjects were requested by 
mimeographed directions to indicate whether or not each had occurred in his ex- 
periential background (‘‘actually happened to you’’), and then to rank order each 
as to painfulness (from 1 indicating the greatest degree of painfulness through 10 as 
least painful). Subjects were finally asked to note their age, sex, education and 
occupation. Signing of names was optional. 


Subjects. The Pain Survey was administered by instructors to groups of students 
in beginning psychology classes at Washington University in St. Louis, Missouri. 
Originally given to 150 subjects, including individual administration to friends of the 
writer, the more recent exclusion of protocols of non-students, those not group ad- 
ministered, and those incompletely responded to, reduced the number of subjects to 
100, evenly divided as to sex. Random exclusion of a few acceptable protocols per- 
mitted this desirable number and division of subjects. The present sample was 
made up of 50 males, age range 18 through 33 and mean age of 20.70 years, and 50 
females ranging from 18 through 32 years, with a mean age of 20.16 years. 


Rating Procedures. The 1000 responses (10 per subject) were tabulated and 
placed into response categories as judged by the writer to be relatively distinct from 
one another. These pain categories were established on the basis of specificity of 
physical locus (e.g., headache, tooth drilling, mashed finger), kind of physical injury 
(e.g., broken bone, cut, burn), or kind of situation producing the pain (e.g., auto 
accident, surgical operation, death of a loved one, feelings of rejection). Since these 
categories contained varying degrees of physical orientation, it was decided to have a 
trichotomous classification according to ‘‘painfulness type’’; and to obtain this, four 
clinical psychologists were independently required to classify each of the categories 
according to the following instructions: (a) Physical: painful situations or occur- 
rences in which the physical factors are of primary importance in producing the feel- 
ings of pain. Psychological factors, if present, are relatively unimportant; (b) 


'The writer is indebted to Drs. Rex M. Collier, Henry N. Peters, Charles A. Dailey, Mr. Albert 
E. Wibbenmeyer, and Mr. Edward V. Hackett for helpful criticism and suggestions in devising the 
Pain Survey, and the treatment and presentation of results; and to Drs. Richard J. Sturm, Richard 
G. Murney, Harold T. Ginsparg, and Mr. Gerald M. De lany who were judges for the typing of the 
pain categories. 
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Psychological: painful situations or occurrences in which psychological (emotional) 
factors are of primary importance in producing the feelings of pain. Physical factors, 
if present, are relatively unimportant; and (c) Blend: situations or occurrences which 
are not clearly physical or psychological, but seem to contain components of both. 

Initially, the presence or absence of an identifiable physical stimulus was con- 
sidered as a criterion but this proved inadequate. The ratings by the judges promoted 
an interesting exchange of opinion regarding pain, and discussion revealed agreement 
in regarding all pain as having both physical and psychological components, whether 
or not each can be specifically identified. The relative emphasis on one or the other, 
or both, determined the classification type for each category. Complete agreement 
among the four judges was obtained for 71 per cent of the 200 categories. An addi- 
tional 18 per cent was agreed upon by three out of four judges. The ‘‘psychological”’ 
type showed the best and very high agreement, closely followed by the ‘‘physical’’. 
Disagreement was most evident for the “‘blend” type, which was made up of those 
categories judged as such by three or all four judges, or those for which 75 percent 
agreement was not obtained and therefore were not classified in one of the other 
two types. 


RESULTS 


The distribution of Pain Survey responses according to categories and pain- 
fulness type is shown in Table 1. The 200 categories and 3 painfulness types needed 


TABLE 1. DistRIBuTION oF PaIn Survey RESPONSES: 
BY Type, CATEGORY, AND MEAN PAINFULNESS RANKING 











No. of Responses 
Painfulness Categories | 





Mean 

No. Ranking 
Physical 76 511 5.58 
Psychological 72 227 5.41 
Blend 52 262 5.43 


Total 200 | 1000 | 5.50 


Type 

















to classify the 1000 responses serves to emphasize the range, complexity and sub- 
jective nature of concepts regarding what is painful. It is evident from Table 1 that 
the physical categories were most frequently responded with, but, of particular 
interest is the 22.7 per cent (227 of 1000) psychological responses which reflect con- 
ceptualization of ‘‘painfulness’”’ which does not involve physical stimulation as com- 
monly conceived. 


Frequency and Painfulness. Table 2 presents the pain categories most frequently 
mentioned by the 100 subjects, with the categories listed in order of decreasing pain- 
fulness (mean rank order) for each of the painfulness types. As seen in Table 2, 
burns were the most frequently mentioned pain, having been listed 73 times by the 
100 subjects. Broken bones and headaches were each mentioned 46 times, and cuts 
41 times. Dental painfulness of some kind was noted 72 times by the 100 subjects. 
These are, of course, those pains which are most likely in the experiential back- 
ground of the average individual. The exception is broken bones, for which tabula- 
tion disclosed that only 24 out of 46 (52 per cent) had experienced a fracture, whereas 
93 per cent of the responses indicating burn, headache, cuts, or dental pain were 
checked as having been experienced before. This suggests that some pains do not 
have to be in the hierarchy of personal and direct experience to be brought to mind 
when one is asked to list painful situations or things. 
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TasBLe 2. Most Frequent Pain CatTecorres BY TYPE AND 
MEAN PAINFULNESS RANKING* 














Pain Mean 
Category Freq. Ranking 
Physical 
Gunshot wound 11 3.27 
Bones broken 46 4.20 
Finger mashed, injured 24 4.58 
Burn 73 4.81 
Cramp 15 5.13 
Joints—ache, sprain 28 5.25 
Head, blow to 14 5.29 
Body blow or pressure 17 5.35 
Toe stubbing 11 5.82 
Toothache 23 6.09 
Stomach-ache ll 6.18 
Shock, electric 15 6.33 
Cuts 41 7.05 
Prick, sharp 11 9.00 
Psychological 
Death of loved ones 17 2.29 
Hurt others’ feelings 6 4.83 
Do poor schoolwork 13 5.38 
Disappointment r{ 5.43 
Falling 12 5.50 
Argue with loved ones 9 5.67 
Feelings of rejection 14 5.71 
See pain of others 11 6.36 
Criticism 6 6.83 
Fail to reach goal 10 6.90 
Embarassing situations 10 7.40 
Do disagreeable task 10 9.50 
Blend 
Accident, auto 20 3.50 
Operation, surgical 18 4.56 
Tooth drilling 25 4.92 
Fall—injury 15 5.73 
Headache 46 5.96 
Tooth, work on 14 6.50 
Hypodermic 22 7.41 





*Responses to Pain Survey. Increasing Mean rankings de- 
note decreasing painfulness. 


Painfulness Type. Table 3 results from this tabulation by individuals and shows 
the incidence and mean number of responses given by the sample population, and by 
sexes. Fourfold chi-square contingency tables with correction for continuity“ were 


TaBLe 3. MBEAN NUMBER OF PaINn SurvEY RESPONSES, BY SEX 
(N = 100; 50 Mates, 50 FemMaes) 




















| Total Male | Female 
ee ae aa Ca, 

Type | No. mM | No | M | No | M 
Physical | sua | 5 | ~~ 27 5.44 | 239 4.78 
Psychological | 227, | 2.27 103 | «(2.06 | 124 2.48 
Blend | = | se | 125 | 2.50 | 137 | = (2.74 

Total 1000 =| S«10.00_—Ss=*S! 500 «=| «610.00 500 | 10.00 





used to test for significance of possible sex differences. Using cutoff points approxi- 
mately at the total means as a basis for dichotomous classification of each individual 
as to each painfulness type, the chi-square calculations for physical, for psycholog- 
ical, and for blend disclosed no significant differences between sexes. The comparison 
of males and females as to emphasis on psychological responses most nearly approach- 
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ed significance (P is between .10 and .20 for the two-tailed test) and may indicate 
some tendency for females to stress psychological pain responses more than males do. 


Previous Experience. In responding to the Pain Survey, each subject was re- 
quired to check those of his 10 painful situations ‘“‘which have actually happened to 
you.”’ Tabulation by individuals disclosed that the average male indicated he had 
previously experienced 8.32 of his responses, and the average female 7.72 of hers. A 
fourfold chi-square contingency table classifying each individual as to sex and pre- 
vious experience with their 10 painful situations (cutoff point was less than 8, or 8 or 
more) resulted in a corrected chi-square of 4.20 which was significant at the .05 con- 
fidence level for the two-tailed test. Therefore, for the subject sample, males stressed 
previous experience in listing painful things to a significantly greater extent than did 
the females. 

The categorical tabulations were checked to identify and further examine those 
responses which were not previously experienced by a high percentage of subjects 
giving them. Six categories with at least 11 responses in each, and with less than 53 
per cent of previous experience for each category, were noted. These categories were 
gunshot wound, auto accident, broken bone, surgical operation, falling, and feelings 
of rejection. Frequency and mean rank order for these pain categories can be seen in 
Table 2. A total of 121 responses were given in these six categories and their mean 
rank order was 4.35. For the 52 responses previously experienced, mean rank order 
was 4.55; and the 69 responses not previously experienced had a mean rank order of 
4.20. Since any given subject could and sometimes did respond with more than one 
of these six categories, they are not independent and the degree of interdependence is 
uncertain. Significance tests were not made but these mean rank order values do 
suggest that these six categories were ranked as more painful than was the average 
response of the entire subject sample (mean rank order is 5.50). Also, the situations 
comprising these six categories often imply a significant measure of psychological 
trauma and/or physical injury. As an interesting sidelight, eight of the subjects list- 
ed ‘childbirth’ as one of their 10 painful situations. None of these subjects had ex- 
perienced it but it was given a mean rank order of 3.75 indicating high painfulness, 
and the three males ranked it as more painful (mean = 3.00) than did the five females 
(mean = 4.20). 


SUMMARY AND CONCLUSIONS 

A Pain Survey was devised to assess painfulness concepts by requiring each 
subject to list ‘ten situations which you associate with pain”; and subsequently to 
indicate those ‘‘which have actually happened to you’’, and to rank order all 10 ac- 
cording to painfulness. The survey was administered to groups of university stu- 
dents, with the subject sample consisting of 50 males and 50 females having a mean 
age of 20.43 years. The 1000 responses were placed in 200 pain categories and these 
categories typed as physical, psychological, or blend according to the independent 
judgment of four clinical psychologists. 

1. Painfulness concepts embrace a considerable range and variety of exper- 
iences, including not only specific physical injuries and ailments but, also, situations 
which are primarily or to varying degrees of a psychological (emotional) nature; and 
painfulness concepts also include situations which are not in the personal and direct 
experiential background of the individual. 

2. Nosignificant differences between sexes were found as to emphasis upon the 
physical, psychological, or blend type of response. There may be some tendency for 
females to think more in terms of psychological pain than males do. 

3. In listing 10 painful situations, males stressed things in their personal ex- 
periential background to a significantly greater degree than did females. Things 
listed but not in the personal experiential background tend to be things of greater 
than average painfulness. 
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THE BENDER GESTALT TEST AND LEARNING DISTURBANCES 
IN YOUNG CHILDREN 


ELIZABETH MUNSTERBERG KOPPITZ* 
Children’s Mental Health Center, Columbus, Ohio 


This study, using young school children, was designed to explore the usefulness 
of the Bender Gestalt Test in discovering learning disturbances that are primarily 
due to problems in visual motor perception. An attempt was made to find out how 
well the Bender Gestalt Test can differentiate between children whose achievement 
in reading, writing, and spelling is above average, and those below average. 


METHOD 


Subjects. Two groups of children served as Ss in this study. Group I was used to test 
and refine a scoring system for the Bender Gestalt Test that was sensitive to school 
achievement. Group IJ was used to cross-validate the findings from Group I. 

All Ss were of at least dull normal intelligence. Their age range was from six 
years and four months to ten years and eight months. The mean age for both groups 
was eight years and five months. All Ss were elementary school students in the first 
four grades. Group I included 77 Ss, 41 of whom were selected by their teachers on 
the basis of above average adjustment and achievement, 36 had below average 
achievement. Group II included 51 clinic patients. Twenty of these were referred to 
the Children’s Mental Health Center primarily because of poor school progress and 
learning disturbances; the remaining 31 Ss were referred primarily because of emo- 
tional problems, their school achievement was satisfactory. All Ss in Group I came 
from upper-middle class residential sections of Columbus, Ohio. Group II included 
urban and rural children and their social status ranged from lower to upper middle 
class. 


Procedure. The Bender Gestalt Test was administered to each S individually. The 
classroom teachers gave the test to the Ss in Group I after receiving careful instruc- 
tions and a manual on procedure. In Group II each S was given the Bender Gestalt 
Test by the writer along with a battery of other tests at the time of psychological 
evaluation at the Children’s Mental Health Center. 

After testing numerous scoring items on random Bender protocols of school 
children, 20 categories were selected for use in this study. Most of these were adapted 
from Pascal and Suttell’s“ scoring scheme. Each scoring category was operation- 
ally defined and examples were given. Each category was scored as either present 
or absent. In case of doubt, an item was not scored. The 20 scoring categories were 
as follows: 


A. Distortion of Shape (all Figures) K. Shape of Circle (Figure 2) 

B. Rotation (all Figures) L. Deviation in Slant (Figure 2) 

C. Workover or Second Attempt (all M. Dashes or Dots for Circles (Figure 2) 
Figures) N. Blunting (Figure 3) 

D. Part Missing (all Figures) O. Number of Dots Incorrect (Figure 3) 

E. Confused Order (total protocol) P ~—. a Figure not Joined (Figures A 

. poem Pa Se i la anc 

F. Overlapping of Figures (all Figures) Q. Three or more Angles in Curve (Figure 

G. Compression of Figures (total protocol) 6) 

H. Substitution of Circles or Dashes for R. Angles, Missing or Extra (Figures 7 
Dots (Figures 1, 3, and 5) and 8) 

[. Perseveration (Figures 1, 2, and 6) S. Guidelines (Figures 2, 3, and 5) 


J. Wavy Line (Figures 1 and 2) T. Enclosure of Figure (all Figures) 


A reliability check with another psychologist yielded 93% agreement on the 
independent scoring of 14 Bender protocols. The writer then scored all protocols 
“blindly”. Thereafter, they were grouped according to grade placement and school 
achievement of the Ss. The presence or absence of deviations on the Bender protocols 


*The writer wishes to express her appreciation to Mrs. Jenny Raduege, Miss Judy Ward, and 
David D Blyth for their valuable assistance. 
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was next analyzed and compared by means of chi-square tests. Each scoring cate- 
gory was tested separately to see whether it differentiated significantly between the 
above average and below average students. This was done for the first two grades, 
for the third and fourth grades, and for all four grades combined. Only those scoring 
categories were considered as differentiating markedly between the good and poor 
students in which at least two of the three P values obtained were significant at the 
five per cent level or better. Seven of the 20 categories met this requirement. These 
were: Distortion of Shape, Rotation, Substitution of Circles and Dashes for Dots, 
Perseveration, Failure to Integrate Parts into Wholes, More than three Angles in 
Curves, and Extra or Missing Angles in Hexagon. 

When each Bender Figure was analyzed separately in the category Distortion of 
Shape, it was found that distortions (as defined in this category) occurred rarely in 
Figures 1, 2, 4, 6, and 8. When they did appear it was equally often among good and 
poor students. On the other hand, distortions in Figures A, 3, 5, and 7 differentiated 
well between the two groups of students. Thus only distortions in Figures A, 3, 5, 
pu 7 were included in the final scoring system. 

A comparison of those children using only one sheet of paper for the Bender 
with those using two or three pieces revealed no significant difference in regard to 
school achievement or Bender Test performance. 


FINAL ScorinG SysTeM 


The original 20 scoring categories were reduced to a final scoring system which 
only included the seven significant categories. A Composite Score was then com- 
puted for each S by adding all his scoring points. The highest Composite Score a S 
could get was 31. The final scoring system was once more tested on Group I. After 
computing the Mean Composite Scores for the first two grades, the third and fourth 
grades and all four grades, chi-squares were used to compare the good and poor 
students whose Bender Composite Scores were above and below the Group’s Mean 
Composite Score. Table 1 shows significant results indicating that good students 


TaB_e 1. A CoMPARISON OF BENDER COMPOSITE SCORES FOR ABOVE AND BELOW AVERAGE STUDENTS 
oF Groups I anp II witn X? anp P VaLuEs 




















Mean Good Students | Poor Students 
Grades Composite Mean Mean 
Score Above Below Above Below X? to 
Group I 
1&2 5.6 5 17 15 6 10.2 > .001 
3&4 4.1 4 15 9 6 5.4 < .02 
all 4.9 11 30 26 10 15.9 < .001 
Group II 
1&2 8.7 1 10 2 2 14.5 <.001 
3&4 5.7 0 9 12 5 11.8 < .001 
all 7.2 2 18 21 10 16.4 < .001 











tend to have a low Composite Score, that is, few deviations on the Bender, whereas, 
poor students will tend to show a high Composite Score or many deviations. 


Cross-validation. A cross-validation of the final scoring system was carried out on 
Group II. The test protocols were analyzed and Composite Scores were computed 
for each S in Group II. As would be expected, the Mean Composite Scores for Group 
II were somewhat higher than for Group I. It was felt that these higher scores are 
probably closer to Mean Composite Scores for elementary school children generally. 
Thereafter chi-squares were computed to determine the validity of the scoring system. 
The results in Table 1 indicate with a high degree of confidence that the Bender 
Gestalt Test can differentiate between groups of children with learning problems 
and with satisfactory school achievement. 





ELIZABETH MUNSTERBERG KOPPITZ 


BENDER GeEsTALT Test AND IQ 

It is well known that bright children tend to have higher school achievement 
than dull children. The Bender Gestalt Test, a test of visual motor perception, was 
found to be related to school achievement also. An attempt was made to explore the 
relationship of school achievement to IQ and the Bender Test. 

All Ss in Group. II had been given the WISC or the Stanford-Binet Scale at the 
same time the Bender was administered. The distribution of above average and 
below average IQ and high or low achievement was compared by means of a chi- 
square. Table 2 shows that a significant relationship exists between IQ and school 
achievement. But a comparison of Tables 1 and 2 shows that the Bender is even 
more closely related to achievement than is the 1Q. 

For the first two grades visual motor perception and IQ are both significantly 
related to school achievement and may overlap to a considerable degree. With one 


TaBLe 2. A Comparison OF IQ’s ror ABOVE AND BrLow AVERAGE StrupENTs oF Group II witH 
X? anp P VALUES 








i] 
| 


| Good Students Poor Students 
Grades IQ 99 IQ 100 IQ 99 IQ 100 
| below above | below above X? P 
1&2 | , 9 | 11 3 9.0 <.01 
3&4 ¢ 6 } 13 4 4.6 < 01 < .02 
all : 15 24 7 13.6 <.001 











exception all the good students had good Benders with few deviations (the one ex- 
ception was a Bender of average quality), and all had an IQ of above 100 with the 
exception of two Ss whose IQ’s were in the high 90’s. All the poor students had poor 
Benders and low 1Q’s with the exception of two very immature, slow moving, non- 
verbal children. Two bright Ss with severe visual motor difficulties were, also, 
among the poor students. 

On the third and fourth grade level there was found to be a somewhat greater 
discrepancy between school achievement and IQ than among the younger Ss. All 
good students were found to have good Benders—- even the three Ss whose 1Q’s were 
below 100. Among the poor students with below average IQ, 10 had poor Benders, 
while three had fairly adequate Benders but revealed severe emotional problems in 
addition to difficulties in auditory perception. Speech problems, poor auditory per- 
ception and extremely low frustration tolerance were present in the four children 
who did poorly in school despite above average IQ’s. Two of these had quite ade- 
quate Benders. 


DIscuUsSsION 


Very few children under the age of nine years can reproduce the Bender Figures 
perfectly. However, not all deviations on the test are related to learning disturb- 
ances, nor is it felt that they all indicate serious problems in visual motor perception. 
Of the 20 deviations tested in this study only seven proved to be related to school 
achievement. The other deviations appeared to be primarily related to age or to 
emotional factors. 

Pascal and Suttell’s“) table, comparing Bender deviations of psychotic patients, 
young children and normal adults, was used to determine which deviation could be 
considered to be primarily of psychogenic origin and which were chiefly indications 
of immaturity. 

Among young school children there was found to be little relationship between 
school achievement and the ability to arrange the Figures neatly on a sheet of paper 
or to draw a straight line, the omission of parts of a Figure, and the blunting of the 
arrow head and the addition of dots on Figure 3. These phenomena were found equal- 
ly often among good and poor students six years of age; they decreased as the Ss 
grew older. 
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There are other kinds of distortion on the Bender that are not related to school 
achievement including the tendency to overlap Figures, to draw careless and dis- 
torted circles, and the failure to maintain the slant on Figure 2. Both immaturity 
and emotional upset may produce these deviations. They were found among the 
very young and among psychotics. 

Several deviations on the Bender seem to be primarily related to tension and 
anxiety since they were found frequently among Pascal and Suttell’s psychotic 
patients but not among the very young. These deviations include the tendency to 
rework Figures, to make several attempts before completing a design, to constrict 
all 9 Figures into less than half of the sheet of paper or to string them along the edge 
of the paper, and finally to substitute tense little lines or dots for circles in Figure 2. 
It seems significant that the good students in this study, who had been considered 
well adjusted by their teachers, showed more of these tension indicators than the 
poor students. The difference was, however, not statistically significant. 

In contrast to the deviations discussed so far there were two groups of deviations 
that were definitely related to learning disturbances. The first of these can be des- 
cribed by the inability to control lines both in directionality and in shape. The follow- 
ing deviations are included in this group: difficulty in drawing angles in Figures A, 
7, and 8; the tendency to rotate Figures; and the inability to draw sinusoidal curves 
on Figure 6. Since all of these deviations are found among young children as well as 
among psychotic patients, according to Pascal and Suttell, it is hypothesized that 
we are dealing here with phenomena of either immaturity and/or a loss of control 
due to confusion or regression. 

The second group of deviations related to learning problems can be described 
by the inability to integrate parts into wholes and the inability to control and ter- 
minate visual motor activity. The following deviations are included in this group: 
distortions on Figures A, 3, 5, and 7; the substitution of dashes or circles for dots on 


Figures 1, 3, and 5; perseveration on Figures 1, 2, and 6; and the failure to integrate 
parts on Figures A, 4, and 7 into meaningful wholes. All these deviations occurred 
only among Pascal and Suttell’s very young children but not to any great extent 
among their psychotic patients. It is, therefore, hypothesized that these phenomena 
indicate immaturity in younger children, while they suggest retardation and possible 
brain damage in older children. 


SUMMARY 


A scoring system for the Bender Gestalt Test sensitive to learning problems in 
young school children was developed on 77 first to fourth graders. Items from Pascal 
and Suttell’s scheme were adapted for this study. Of the 20 scoring categories tested, 
only seven were found to differentiate significantly between good and poor students 
including: Distortion of shape, rotation, substitution of circles and dashes for dots, 
perseveration, failure to integrate parts into wholes, three or more angles in sinusoid- 
al curves, and extra or missing angles on hexagons. For each S a Composite Score was 
computed by adding the number of significant deviations in his protocol. The scoring 
system was cross-validated on a group of 51 school children. The results indicate that 
the Bender Gestalt Test can differentiate significantly between above average and 
below average students in the first four grades of school. 
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JUDGING INTELLECTUAL STATUS FROM THE 
BENDER-GESTALT TEST 


ROLAND M. PEEK AND LOWELL H. STORMS! 
Hastings (Minn.) State Hospital 


There have been few studies investigating the relationship of intelligence to 
performance on the Bender-Gestalt test. Some clinicians believe B-G performance to 
be related to intelligence (for example, Pascal and Suttell®, though they find no 
correlation with their B-G score); objective studies have considered primarily only 
the B-G Recall“: ® ®, a single B-G index, or extremely limited intellectual ranges 
‘), The most adequately controlled study“ suggested that there is apparently a 
small but consistent relationship between number of figures recalled and intelligence, 
with a slightly stronger correlation between number of figures recalled and the level 
of intellectual functioning. No controlled examination of the total protocols or of 
more than one B-G variable has yet been reported. 

This study tests the primary hypothesis that (a) intellectual capacity and (b) 
intellectual efficiency can be subjectively estimated from the B-G, using the Shipley- 
Hartford Scale (S-H)“ as a criterion. The inter-judge reliability of such estimates 
is also investigated, as well as the relative effectiveness of judges with varying de- 
grees of clinical experience with the B-G. It should be pointed out that this is not an 
attempt to test the relationship of specific B-G behaviors to intelligence; it is at this 
stage simply a controlled investigation of a clinical situation wherein the psycho- 
logist attempts to evaluate subjectively the subject’s intellectual status from his B-G 
protocol, using whatever method of analysis he chooses. 


SUBJECTS 


Subjects for the study were randomly and independently selected (the first 100 
psychiatric non-neurological in-patients listed in the alphabetical case files and meet- 
ing the criteria) by four students.? Regardless of sex, those subjects were included 
who were between the ages of 16 and 59, inclusive, and who had been given both the 
B-G and S-H within seven days of each other, both of which tests had been admin- 
istered previous to the two months preceding the judging (thus decreasing the likeli- 
hood of a judge recognizing the test protocol). To be included, the subjects must have 
achieved a S-H vocabulary raw score of 16 words or more with at least one abstrac- 
tions item attempted, regardless of its correctness. On the B-G both Original and 
Recall“ must have been attempted, regardless of the number or accuracy of figures 
reproduced. In cases where several sets of these tests existed, only the first set was 
used (to eliminate practice effect). 

The resulting 100 subjects had a modal age range of 30-39 and were by chance 
evenly divided by sex. Seventy-nine were diagnosed as psychotic (49 schizophrenic 
reactions, 20 psychotic depressive reactions, 10 psychoses of other types or un- 


TaBLeE 1. INTELLECTUAL LEVEL oF SuBsEcTs (N = 100) BasEp on S-H 
VOCABULARY OR ABSTRACTIONS Score (WHICHEVER HIGHER) 








Shipley-Hartford | 





Intellectual level Vocabulary | Abstractions | 





|— 
Superior or above | 33-40 | 


Low Average 17-20 


High average 29-32 30-3: | 
Mentally Deficient 16-below | 14-below | 


Average | 21-28 





‘Now at Department of Psychiatry, University of California Medical Center, Los Angeles 24, 
California. 

*The authors are greatly indebted to the following students for their assistance on this project: 
Mrs. Julene Nelson, Miss Janice Peterson, Miss Ruth Swanberg, and Miss Katherine Swanson. 
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TABLE 2. INTELLECTUAL IMPAIRMENT OF SUBJECTS 
(N = 100) , 


Impairment S-H CQ N 
None 95-above 27 
Very mild 90-94 9 
Mild 80-89 20 
Moderate 70-79 18 
Severe 60-69 21 
Very severe 59-below 5 











specified), with 11 neuroses, 6 personality disorders, one mental deficiency (primary 
diagnosis) and one undiagnosed. Distributions of intellectual level and degree of im- 
pairment, as measured by the criterion instrument, are indicated in Tables 1 and 2. 
Parenthetically it should be noted that the negatively skewed distribution of in- 
tellectual level presented a conflict for the judges, who reported that they frequently 
felt they were assigning too many cases to the higher intellectual levels. Whether 
this influenced them to avoid judging as many at the higher levels later, thus reduc- 
ing their accuracy, is of course unknown. 


PROCEDURE 

All identifying information, including date of administration, was obscured by 
the assisting students so that each record contained only the subject’s age and sex 
in addition to his test reproductions. After randomizing the B-G’s, the students 
presented the protocols to three judges* with varying lengths of clinical experience 
with the test: Judge “‘A”’ for 11 years, Judge ‘‘B”’ for three years, and Judge ‘‘C”’ for 
two years. Each judge classified each B-G into one of the following categories of in- 
tellectual level: superior, above average, average, below average, mentally defective. 
Each judge also estimated the extent of impairment, using one of the following 
categories: none, very mild, mild, moderate, severe, or very severe. Both sets of 
categories are defined in terms of S-H scores as shown in Tables 1 and 2. 

The accuracy of each set of judgments was assessed, as well as inter-judge 
agreement. Some effort was also made to estimate the seriousness of judging errors 
and their relationship to length of experience with the test. All tabulations were per- 
formed independently by the students. 


RESULTS 
In order to provide adequate frequencies in the cells for the computation of chi 
squares in testing validity of judgments, the superior and high average categories 
were combined, as were the low average, borderline, and mental defective categories. 
For the same computation, the impairment categories of “‘none’”’ and ‘‘very mild” 
were combined, as were “mild’’ and “moderate”, as well as “‘severe’’ and “very 
severe’. 


TABLE 3. RELIABILITY OF JUDGMENTS OF INTELLECTUAL LEVEL AND IMPAIRMENT 
FROM B-G Protocots (N = 100) 








| Per cent Significance 


Agreement Level 


| 
Intellectual | Pairof | Contingency | Significance 


functioning Judges | Coefficient | Level 





| 
AB | __ .38* p<.o1 | 64 | p<.001 
Intellectual level A-C .54 p<.001 | 69 p< .001 
B-C .29 p> .05 53 p<.001 
} 
| 


A-B 47 «| p<.00l 
Intellectual Impairment A-C .52 p<.001 
B-C 46 p<.001 





54 p< .001 
58 p<.0l 
47 p<.0l 

















*With the contribution of one cell omitted due to too low (one) an expected frequency. 


*Acting as a third judge was Howard J. Bennett, also of Hastings (Minn.) State Hospital staff. 
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As can be seen from Table 3, pairs of judges rating the same protocols agreed 
from 53 to 69 percent of the time (p < .001) when estimating intellectual level from 
the B-G, and from 47 to 58 percent of the time (p < .01 to < .001) when estimating 
impairment. Contingency coefficients give a better estimate of reliability, however, 
and those in Table 3, ranging from .29 to .54, are rather unimpressive; the lowest is 
not even significant at the five percent level. For the most part, it can nevertheless 
= said that pairs of judges agree on these ratings significantly more often than 
chance. 


TABLE 4. VALIDITY OF JUDGMENTS OF INTELLECTUAL LEVEL AND IMPAIRMENT FROM B-G ProtocoLs 
(N 00) 








Intellectual 


| | | 7 
Judge | Chi Square | Significance Per cent Significance 
functioning 


| 
| 
Level Agreement Level 
| | with Criterion | 





p> .05 ~ 


53 p<.00l 
p> .50 - 


46 p<.001 
* 52 p<.001 


Intellectual level | 

p<. | 40 p> .05 
| 
| 





p<. 50 p< .001 
p< .0: 35 p> .05 





| 
| | p> .20 
| 
| 
| 


| 
Intellectual Impairment | 
| 





The same cannot be said for the accuracy of the ratings (see Table 4). In only 
one case (Judge ‘‘B’s’’ estimates of impairment) are both the chi square and the per- 
centage accuracy significantly higher than chance expectancy under the null hypothe- 
sis, although the percentage of correct estimates of intellectual level was significant 
for all judges, as were the chi square values for estimates of impairment. 

To perform a simple check on the seriousness of judging errors, a tally was made 
of instances where (a) the subject was below average but rated by any judge as being 
above average, or (b) the subject was above average but rated as below average. No 
errors of the first type were made; all errors were of the second type, 7.e., were under- 
estimates. One case (a paranoid schizophrenic with very severe impairment) was 
misjudged by all three judges; nine others were misjudged by only one judge. These 
were all cases involving schizophrenics except for one personality disorder and one 
psychoneurotic, which is roughly proportionate to the diagnostic distribution in the 
sample as a whole; all had varying degrees of impairment. 

The judges had not discussed among themselves the specific indications of 
intellectual status as reflected on the B-G, since the test is not ordinarily used in this 
connection. Therefore it is not surprising to note that there was a general relation- 
ship between extent of experience with the test and the frequency of serious errors as 
defined above; the most experienced judge made two such errors, while the least ex- 
perienced made five. It was also noted that the most experienced judge was more con- 
sistent over all ranges of intellectual level and impairment, while the less experienced 
judges were more erratic, with some ranges more accurately estimated than others. 
However, Tables 3 and 4 do not indicate any impressive trend concerning overall 
accuracy and extent of experience. 

It can be concluded, therefore, that subjective judgments of intellectual status 
based on the B-G can be made with some consistency and accuracy by some judges; 
however, this is of such a small magnitude that clinicians (at least those involved in 
this study) should be discouraged from such a practice. 


SUMMARY 
This study tested the hypothesis that intellectual level and degrees of intellec- 
tual impairment could be subjectively estimated from B-G protocols. Using only the 
B-G productions of 100 psychiatric in-patients randomly selected according to pre- 
scribed criteria, three judges estimated these two intellectual factors for each case. 
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Inter-judge reliability was tested and accuracy of judgment was measured using S-H 
scores as a criterion for intellectual status. Some evaluations of seriousness of 
errors and length of experience of the judges were also made. 

It was concluded that the reliability and validity (though statistically signi- 
ficant for many of the judgments) is not considered to be of sufficient magnitude to 
merit subjective estimates of intellectual level and intellectual impairment from the 
B-G protocols alone, at least where individual cases are concerned; also, these judg- 
ments varied in some respects according to the experience of the judge. The hy- 
pothesis was therefore not confirmed (for practical purposes at least) for the three 
judges involved. Though objective B-G indicators of intellectual status may exist, 
this study has not pointed to any short-cut to their study compared with trial and 
error exploration with an objective scoring system. 
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EFFECT OF HYPOPHYSECTOMY ON BENDER-GESTALT 
TEST PERFORMANCE! 


MARTHA SCHON AND SHELDON E, WAXENBERG 


Sloan-Kettering Institute for Cancer Research 
Memorial Center, New York City 


PROBLEM 


Hypophysectomy, the surgical removal of the pituitary gland located at the 
base of the brain, is now performed in an attempt to induce a remission in patients 
with advanced cancer“). Dr. Bronson Ray of New York Hospital who performed 
the surgery on the subjects of this study uses a technique whereby the frontal lobe 
is retracted and the right olfactory nerve is sectioned in order to gain access to the 
pituitary gland“. Following hypophysectomy a deficiency state is created of all 
hormones which are not substituted for, since the pituitary gland, the master control 
organ of hormonal production, has been eliminated. 

Pascal’s standardized scoring system“? renders the Bender-Gestalt Test“ 
available as a tool for measuring the psychological effects of this surgical procedure 
and its hormonal sequelae. The usefulness of the instrument derives from the fact 
that the scoring has been devised so as to correlate with the seriousness of psychiatric 


1This study was supported in part by research grants M-884 and M-1118 from the National 
Institute of Mental Health, National Institutes of Health, U. 8S. Public Health Service. 

The authors wish to e oy their gratitude to Dr. Arthur M. Sutherland, Dr. Olof Pearson and 
Dr. Mortimer B. Lipsett of the Memorial Center for Cancer and Allied Diseases, New York City, 
for making possible the collection of data used in this study. 
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disorder for subjects of at least normal IQ between the ages of 15 and 50 who have 
had at least a year of high school education. Within these ranges, the scores are not 
related to sex, drawing ability, 1Q, or age. 

Under investigation here is the question of what psychological effects ensue from 
hypophysectomy. Will there be changes in the copying of the Bender-Gestalt de- 
signs by this group, all of whom were cancer patients, none of whom was known to 
have had a history of psychiatric referral or treatment? More particularly, will their 
quantitative scores change so as to resemble more closely the scores of psychiatric 
inpatients and outpatients or will they become more like the scores of nonpatient, 
so-called normals? 


SUBJECTS AND PROCEDURES 


The test was administered, as part of a psychological test battery, to each of 20 
postmenopausal women suffering from metastatic carcinoma of the breast within a 
few days immediately preceding hypophysectomy, and then again 6 to 10 weeks after 
surgery. These ward and private cancer patients ranged in age from 40 to 66 years, 
mean age 51 years. Preoperative full scale Wechsler-Bellevue 1Q’s ranged from 67 to 
126, mean 103, with five below an IQ of 90. Four of the patients graduated from 
college or almost finished college work. Five others graduated from high school. Four 
more completed at least one year of high school but did not graduate. The remaining 
7 women had 3 to 8 years of schooling, in several instances exclusively in their Euro- 
pean native lands. These wide educational differences are in some measure adjusted 
by Pascal’s two sets of standard scores, one for persons having had at least one year 
of high school and the other for persons with a background of at least one year of 
college. It must be noted, however, that there were patients with less education than 
this minimum among the group of subjects studied in this research, and their stand- 
ard scores were obtained from the high school level tables. 


The two authors of the Bender-Gestalt scoring manual? report a scoring re- 
liability of .90 between their individual scorings of a large set of records. The scorer 
of the hypophysectomy data (S. E. W.) obtained a product-moment correlation co- 
efficient of .89 in comparing his scores on a set of 20 sample records with the manual 
authors’ scores. 


RESULTS 


Before hypophysectomy, the Bender-Gestalt raw scores of the 20 patients 
ranged from 9 to 124, mean raw score 46.4. Postoperatively, the raw scores ranged 
from 6 to 146 and the mean raw score was 56.9, showing worsening in the test per- 
formance. A ¢ test on the mean difference ®) between the 2 sets of 20 scores yields a 
value of 2.53, significant between the .05 and .02 levels of probability. Standard 
scores, which help equate for educational differences, ranged from 40 to 162 with a 
mean of 81.5 before surgery and from 37 to 186 with a mean of 92.7 afterwards. This 
mean difference yields a t of 2.40, also significant between the .05 and .02 levels. Wil- 
coxon’s nonparametric method of paired replicates “’) gives the same significance level 
for the differences. Thus, it appears that hypophysectomy results in Bender-Gestalt 
performances with more distortions than preoperatively. 

Of the 10 patients who enjoyed remission of their disease, 3 improved their 
Bender-Gestalt performances while 7 obtained worse, that is, higher scores. Of the 
10 patients who did not experience remission, 3 received better scores, 5 received 
worse scores, and the scores of 2 did not change at all. 

Pascal’s scoring method assigns scores of 1, 2, 3, 4, 5 and 8 points for various dis- 
tortions in the copying of the designs. Those deviations which characterize hospital- 
ized psychiatric patients and which are infrequently encountered in the records of 
nonpatients are scored eight points each. Tabulation of eight-point deviation scores 
in the records of the 20 breast cancer patients revealed an increase from a total of 38 
for the group before surgery to a total of 49 after surgery. This change is not signi- 
ficant at the .05 level of probability by either ¢ test °) or Wilcoxon’s paired replicates 
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method“. Thus, specifically psychotic performance did not increase appreciably in 
frequency of occurrence. 

Pascal found that if a standard score of 67 was used as a cutoff point, there were 
theoretically only 5 chances in 100 of including a nonpatient record in a screening. 
This criterion was used to divide the present group into two categories, those re- 
ceiving standard scores below 67, who will be regarded as resembling Pascal’s non- 
patients, and those receiving standard scores above the cutoff point, who will be con- 
sidered to be similar to Pascal’s psychiatric patients. Preoperatively, 9 patients fell 
in the ‘normal’ category and 11 in the ‘psychiatric’ category. Postoperatively, 3 of 
the 9 ‘normals’ moved to the ‘psychiatric’ category. There were no shifts in the op- 
posite direction. A chi-square test, with the nonindependence of the two group classi- 
fications taken into account and with the correction for continuity applied, gives a 
value of 1.33, not significant at the .05 level. Thus, while a greater number of these 
patients than would have been expected on the basis merely of their physically ill 
status appeared to resemble psychiatric patients in accordance with this Bender- 
Gestalt screening, a significant additional number did not shift into the ‘psychiatric’ 
category after undergoing hypophysectomy. 


DISCUSSION 

Preoperative interviews indicated that the patients were more threatened by 
the hypophysectomy than by any previously experienced operations. The pre- 
operative similarity in Bender-Gestalt scores of the present group of subjects to 
psychiatric patients might be related to the fact that they were under tremendous 
emotional stress as they faced this critical surgery. 

Two major variables enter into this study. One is brain surgery, and the other 
is elimination of pituitary hormones. It is unlikely that the worsening of test per- 
formance is related to brain damage since those subtests on the Wechsler-Bellevue 
Intelligence Scales which are considered as diagnostic indicators of brain damage 
did not show any significant postoperative changes“. 

At the time the patients were retested after surgery they were deficient in all 
hormones except cortisone which was administered before, during, and after surgery 
as it is necessary for the maintenance of life. The extirpation of the pituitary gland, 
therefore, effected mainly a secondary hypothyroidism and an elimination of the 
gonadotropic and somatotropic hormones. Postoperative interviews revealed that 
feelings of increased nervousness and irritability were also present in the group of 
patients at the time clinical hypothyroidism was noted. Follow-up interviews in- 
dicated that after thyroid substitution therapy was begun these symptoms tended to 
disappear. It would seem, therefore, that the worsening of Bender-Gestalt per- 
formance may have been related to secondary hypothyroidism. 


SUMMARY 

The Bender-Gestalt test was administered to 20 women before surgery and then 
again 6 to 10 weeks after hypophysectomy for recurrent metastatic breast cancer. 
Pascal’s quantitative scoring technique was used to appraise changes in test perform- 
ance. 

Scores worsened significantly after surgery in the direction of more closely re- 
sembling those of psychiatric inpatients and outpatients. There was not a significant 
change in incidence after surgery of the sort of distortions which rather specifically 
reflect psychotic behavior on the test (8-point deviation scores). Application of a 
screening criterion presented by Pascal for differentiating ‘normal’ from ‘psychiatric’ 
records revealed a majority of the 20 research subjects to be in the ‘psychiatric’ 
category even before surgery ; however, there was no significant further shift into this 
category postoperatively. 

The poor initial performances are attributed at least in part to chronic disease 
status and threat of impending critical surgery. The changes observed after surgery 
are ascribed not to the surgical procedure itself but rather to resultant hormonal 
deficiency, most importantly to hypothyroidism. 
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READING FAILURE AS A FACTOR IN THE WAIS SUBTEST 
PATTERNS OF YOUTHFUL OFFENDERS 


E. E, GRAHAM AND D. KAMANO 
Colorado State Department of Education Galesburg (Ill.) State Research Hospital 


PROBLEM 


Research with unsuccessful readers (UR) has revealed a Wechsler-Bellevue 
Intelligence Scale (WBIS) pattern similar to that attributed to youths suffering from 
character disorders“: ? * 4 5), Graham“: 279-271) finding that the WBIS scattergram 
of the UR corresponds closely to the scattergram described by Wechsler“ for the 
adolescent psychopath, questioned whether young criminals are poor in reading due 
to learning difficulties and whether this, rather than the character disorders, might 
be partially responsible for the distribution of scores. The population of this study 
was educationally retarded but not in trouble with the law, and the profile seemed 
typical of the educationally retarded youth without regard to moral qualities. 

For the current study it was hypothesized that the ‘‘criminal’’ UR would pro- 
duce a “psychopathic profile” but that the “‘criminal’’ successful reader (SR) would 
not. The problem was to find a population of youths of average intelligence who had 
been committed to institutions because of crimes and to separate two groups of SR’s 
and UR’s. The next step was to examine these youths and determine: (a) if there 
was any marked dissimilarity in the WAIS profiles of ‘‘criminal’’ SR’s and UR’s; and 
(b) if there were any subtests in which both SR’s and UR’s were in like direction 
significantly deviant from the norm group. 


PROCEDURE 


Randomly selected inmates of a federal correctional institution were administer- 
ed the reading section of the Jastak Wide Range Achievement Test (WR) “. These 
young men had all been committed under the Federal Juvenile Delinquency Act. 
Persons familiar with commitment procedures know that juveniles are rarely, if 
ever, committed to a federal correctional institution the first time they appear be- 
fore the courts. It was assumed that the experimental subjects had for some time 
failed in social adjustment. The average educational achievement level of this in- 
stitution was 6.5 grades according to records of the Director of Education. 

Two groups were selected from the examined population. Group I consisted of 
UR’s who were defined as those inmates who failed to achieve more than .6 of their 
expected grade levels. Group II consisted of SR’s who were defined as those who 
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Tas_e 1. Composition oF EXPERIMENTAL Groups IN TERMS OF CRITERIA OF SELECTION 











Unsuccessful Successful Total 
Criteria Readers Readers Group 








Number 33 
Chronological Age (years) 17 .02 
Confinement (months) : 
Education Completed (years) 8.2 
W. R. Achievement Test: Reading Grade 5.3 
Per cent Reading Level Attained 45.64 
WAIS: Mean Verbal IQ 90 .33 
Mean Performance IQ 99.75 
Mean Full Scale IQ 93 .93 








TABLE 2. COMPARISON OF SUBTEST PERFORMANCES BETWEEN READING LEVELS 








Unsuccessful Successful 
Readers Readers 


Subtest Mean Rank LD. Mean Rank 





Information . : 9.94 7 

Comprehension 7.96 10.49 6 
Arithmetic 7; 10.86 2.5 

Similarities if ¢ : 9.47 11 
Digit Span 8.15 j 3.8 10.86 2.5 
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Vocabulary 9.69 
Digit Symbol 9.49 
Picture Completion : 10.54 
Block Design 10. ‘ oad 10.69 
Picture Arrangement 10.96 2.6 11.37 
Object Assembly 9.5! 6 9.77 
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*Significant beyond the .01 level of confidence. 


achieved to .8 or higher of expectancy. Thirty-three met the criteria for the UR and 
35 for the SR. The actual mean reading level of the UR was .4564 of expectancy; 
that of the SR was .8974. UR’s ranged from .22 to .60; SR’s from .80 to 1.20. In 
terms of reading grade levels, the UR’s ranged from 2.5 (second grade, fifth month) 
to 7.3 (seventh grade, third month); and the SR’s from 9.2 (ninth grade, second 
month) to 14.2 (sophomore, college). The subjects were all white. None were less 
than 16-0 years or more than 17-11 years. The UR’s reported having completed 
schooling to a grade level average of 8.2; the SR’s of 9.0. Both groups were admin- 
istered the Wechsler Adult Intelligence Scale (WAIS). 

In his early ‘? °- 2-155) analysis of the diagnostic and clinical features of his test, 
Wechsler indicated that the outstanding feature of the adolescent psychopath is his 
systematic high performance score as compared with his verbal test score. He points 
out that with the exception of Digit Symbol the performance subtests are above the 
mean. Information and Arithmetic are the tests most consistently low. He states 
that it is worthy to note that a “good score is frequently made by the psychopath on 
the Picture Arrangement test ...’’7 ®, Since the theoretical framework supporting 
each subtest remains the same, it is reasonable to assume that psychograms deter- 
mined by personality disorders should be reasonably constant whether the WBIS 
or the WAIS is used. 


RESULTS AND DiscussION 


The two groups differed significantly in 1Q’s as measured by the WAIS. The 
UR’s Full Scale IQ was 93.93 or 11.17 IQ points lower than the 105.1 of the SR. An 
actual difference in intelligence probably functioned to cause some of the discrep- 
ancy in reading achievement; however, it is probable that the difference in educa- 
tional achievement caused some of the disparity in IQ. The researchers do not be- 
lieve that matched IQ’s would assure that the groups were matched in intelligence. 
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They are likewise unsure that the significant difference in IQ reflects as great a bona- 
fide difference in basic intelligence. 


TABLE 3. ComMPaARISON oF UNsuccEessruL READERS AND WAIS SranparpizaTion Group AGED 
18-19 as TO MEAN ScALED ScoRES ON WAIS SustTests 
—-= eS 


Standardization 
Group 


Mean Diff. 








Information 9 2. 
Comprehension 96 55 9. 1. 
Arithmetic af 8 9.§ 1 
Similarities AS § 9. 

Digit Span 15 2.5% 9. 
Vocabulary ).38 46 9. 

Digit Symbol 

Picture Completion 

Block Design 

Picture Arrangement 

Object Assembly 


*Significant beyond the .01 level. 


TABLE 4. CoMPARISON OF SuccEssFUL READERS AND WAIS StTanpDARDIZATION Group AGED 18-19 
AS TO MEAN SCALED ScorEs oN WAIS Susrests 


Successful Standardization 
Readers Group 
Subtests Mean 8.D. Mean Diff. 


9. 





Information 9.94 
Comprehension 10.49 
Arithmetic 10.86 
Similarities 9.47 
Digit Span 10.86 
Vocabulary 9.69 
Digit Symbol 9.49 
Picture Completion 10.54 
Block Design 10.69 
Picture Arrangement 37 
Object Assembly 9.77 
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*Significant beyond the .05 level. 
**Significant beyond the .01 level. 


The SR’s attained scores superior to the UR’s in all areas. This difference was 
significant in the verbal tests and Digit Symbol. The SR’s scores were significantly 
superior to Wechsler’s standardization group in Comprehension, Arithmetic, Digit 
Span, Picture Completion, Block Design, and Picture Arrangement. 

The UR’s scores were significantly inferior to the standardization group on all 
verbal tests and Digit Symbol. They were superior to the standardization group on 
the remaining four performance tests. Both UR’s and SR’s did best on the Picture 
Arrangement test, corroborating Wechsler’s observation that this test is frequently 
high. 
Convicted youthful offenders who are able to read as well as the average person 
do not show marked differences in success on Wechsler verbal and performance 
tests. Similar individuals who are inadequate in reading demonstrate the profile 
which Wechsler described as typical of the adolescent psychopath. Their verbal 
tests are inferior to their performance tests. Certain questions are raised by these 
results. Is there a relationship between the dynamics underlying reading failure and 
those promoting delinquency or the development of character disorders? Do high 
Picture Arrangement scores reflect social intelligence or do they suggest an improved 
skill in reading the non-verbal communications of cartoonists as a result of long 
hours of isolation where tattered comic books are the only companions? 
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SUMMARY 


To test the hypothesis that youthful criminals who are able readers will not 
show the difference in verbal and performance subtests which has been described by 
Wechsler as typical of the youthful psychopath while criminals who are inadequate 
readers will produce such a psychogram, inmates of a federal correctional institution 
were divided by means of the Jastak Wide Range Achievement test into groups of 
successful and unsuccessful readers and were administered the Wechsler Adult In- 
telligence Scale. The 33 UR’s were found to be inferior both to Wechsler’s standard- 
ization group and to the 35 SR’s in verbal subtests and Digit Symbol. Their psycho- 
gram is similar to that described as typical of the youthful psychopath. The SR’s did 
as well on verbal as on performance subtests, and their psychogram does not re- 
semble that considered typical of the youthful psyc —— Both groups achieved 
the highest scores in Picture Arrangement. 
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MMPI PROFILE CONFIGURATIONS 
AMONG CRIME CLASSIFICATION GROUPS 


JAMES H. PANTON 


Reception Center, Central Prison 
Raleigh, N. C 


PROBLEM 
Despite the wide use of the Minnesota Multiphasic Personality Inventory 
(MMPI) with various clinical populations, there have been few studies concerned 
with the MMPI Profile configurations of criminal groups. Within the correctional 
setting, as in other clinical fields, it is of considerable value to have knowledge of the 
typical profile of the select group with which one is working. The purpose of this 
paper is to test the hypothesis that a prison population is more deviant than the 
general population in its MMPI responses and that there is a characteristic profile 

for each of six major crime classification groups. 


PROCEDURE 

The MMPI records on those male prison inmates with intelligence quotients 
above eighty, who were tested at the Reception Center, Central Prison, Raleigh, N. 
C., during the calendar years 1955-56 were screened for validity. Validity scale 
scores above a critical score of T = 70 were rejected except on the F scale where 
scores above T = 80 were disqualified.!. This procedure provided a total sample of 
1,313 “‘valid’’ test results. The sample was next sorted into sub-samples according 
to the type of criminal activity in which each member of the total sample had been 
involved. These crime classification groups were designated as follows: 
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White Collar Group (forgery, embezzlement, false pretense, fraud, etc.) 
Aggravated Assault Group (murder, assault, secret assault, etc.) 
Robbery—Burglary Group (house burglary, store robbery, highway robbery, etc.) 
Property Theft Group (larceny, larceny of automobile, etc.) 
. Aggravated Sex Group (rape, assault with intent to commit rape, etc.) 
6. Sex Perversive Group (homosexual and heterosexual) 


Diagnostic scale means and standard deviations were then computed for each crime 
classification group and for the total sample. K corrections were added to the appro- 
priate scale scores prior to calculation of the means. 


RESULTS AND DIscUSSION 


The diagnostic scale means, standard deviations, and the number of cases in 
each group are presented in Table 1. The relatively high elevation of the diagnostic 
scale means above the standard normative mean and the marked similarities of the 
group profiles are readily noticeable. The greatest single scale difference between any 
two groups is 7.6 T-score units, appearing between the aggravated assault and white 
collar groups on the Sc Scale. 


TaBLeE 1. T-ScorE MEANS AND STANDARD DEVIATIONS FOR CRIME CLASSIFICATION GROUPS AND 
Tota SAMPLE 


__Group Hs D Hy Fa ME Pa Se 











White Collar (N = 233) 
Mean 63.6 66 


3 62. 
8. D. b. 2.9 
N 


1 70.6 54.9 60.5 2.0 61. 
3 10.9 9.5 12.5 12.3 


9 
12 
*MMPI Code 4’21 37869-50. 


Aggravated Assault (N = 157) 
Mean 4.9 66.0 61.0 70.7 
8. D. i i: > a ae a 


MMPI Code 4 
Robbery-Burglary (N = 219) 
Mean 61.4 64.5 60.0 71.2 
S. D. oe 3.5.0.3 .O6 


MMPI Code 4’82 
Property Theft (N = 568) 
Mean 52.0 64.2 58.9 71.5 
S. D. 15. 29 11.5 10:3 
MMPI Code 4’8 27916-3 | 0 
Sex Perversive (N = 73) 
Mean 8 63.7 60.3 70.1 59.0 57.4 
S. D. 24 B26 64 W.t 0.7 1:8 
MMPI Code 4’28 173-596 0 
Aggravated Sex (N = 63) 
Mean 3 63.4 61.4 68. 
8. D. 5. 11.8 10.6 12. 
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3 
MMPI Code 48 1 
Total Sample (N = 1313) 
Mean 62.7 64.8 60.1 0 1.8 

8. D. 15.4 12.9 11.6 6 13.1 
MMPI Code 4’82 7196 3-50 


55.2 63.9 
8.6 12.7 
62 73-950 


55.1 6 
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*Welsh’s System of Coding 


Computation of mean differences and their f-ratios between each of the crime 
classification group means and between the means of each of the groups and the 
total sample means revealed that out of 210 cell differences there are only 23 differ- 
ences which appear significantly different at the .01 level of confidence.* The fre- 
quency with which each diagnostic scale functions as a discriminatory factor at the 
.01 level of confidence is as follows: Hs, D, Pd, Pt and Si: O; Hy: 1; Mf: 3; Pa: 5; 
Sc: 6; Ma: 8. Of the three .01 level discriminations scored by the Mf scale, the Sex 


*Data on mean differences and t-ratios appearing between all the possible group combinations 
may be secured from the author. 
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Perversive Group is a member of all three of the group pairs involved. The Mf scale 
T-score difference between the Sex Perversive Group mean and the total sample mean 
is 3.9 T-score units, which is the only Mf scale difference between crime classifica- 
tion group and total sample means that is greater than .7 T-score units. From these 
observations it appears that the Mf scale is functioning adequately when applied to 
prison groups. 

Table 2 presents the number of scales with T-score means appearing signifi- 
cantly different at the .01 level among the various group combinations. This sum- 
mary indicates that the Sex Perversive Group, with twelve scale differences, is the 
most deviate profile. However, the mean scale difference of this group is only 2.00. 
The mean scale difference of the total sample is only 1.00 which compares favorably 
with the mean scale difference of 1.07 appearing between any possible combination of 
groups. 


TaBLe 2. NuMBER OF SCALES APPEARING DIFFERENT AT THE 1% LEVEL OF CONFIDENCE AMONG 
Group CoMBINATIONS 


White Aggravated Robbery Property Sex Aggravated Total 
Collar = Assault Burglary Theft Perversive Sex Sample 
N=233 N=157 N =21 N=568 N=73 N =63 N =1313 
White Collar — 1 
Aggravated Assault 











Robbery Burglary 
Property Theft 
Sex Perversive 
Aggravated Sex 
Total Sample 
Total 

M Difference 


0 
0 
2 
0 
1 
4 





Note: Total Scales Different = 45 
Mean Scale Difference between any possible combination of groups = 1.07 





It appears evident from the data presented in Table 2 that there is no marked 
difference between the various group profiles. However, all group profiles, as well as 
the total sample profile, appear significantly different from the standard normative 
profile (M = 50, SD = 10). Computation of f-ratios between crime classification 
group scale means, Total Sample scale means, and the Standard Normative means 
reveals that all but two of these means are significantly different at the .01 level of 
confidence. The Aggravated Assault group and the Aggravated Sex group have 
differences on the Si scale which are significant at the .05 level. 

An examination of the coded profiles (Welsh’s System of Coding)? presented in 
Table 1 lends support to our findings that there is a close similarity between group 
profiles. All the profile codes would be classified ‘‘4 prime with 0 low profiles’’.? From 
these profiles one might conjecture the mean personality type of the prison popula- 
tion to be capable of description as follows: irresponsible, impulsive, undependable, 
tactless, egocentric, deficient in ability to calculate own stimulus value, incapacity 
to evaluate the consequences of their behavior, and a marked facility in rationaliza- 
tion. This description does, in fact, fit the most common clinical picture observed 
among prison incarcerants during their initial classification processing. 


SUMMARY 
_A study of the MMPI responses of 1,313 prison inmates revealed a distinct 
prison population profile which may be employed to give added knowledge of prison 
population variations in personality profiling as compared to the profile of people in 
general. However, there appears to be no marked difference between the profiles of 
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six major crime classification groups. Even though several of the diagnostic scales 
discriminate at the .01 level of confidence between various crime classification group 
combinations, none of these discriminations are of such frequency or magnitude to 
warrant the use of separate crime classification profiles. 

In the clinical employment of the prison population profile it is not suggested 
that the profile be substituted as a “normal’’ prison profile from which to measure 
individual deviations, but it is hoped that it might be of some value in providing 
additional insight into the dynamics of prison inmate behavior as well as an aid in 
future research on the use of the MMPI in correctional settings. 
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PREDICTING PRISON ADJUSTMENT WITH THE 
MINNESOTA MULTIPHASIC PERSONALITY INVENTORY 


JAMES H. PANTON 


Reception Center, Central Prison 
Raleigh, North Carolina 


INTRODUCTION 


An important need, in terms of the most efficient utilization of prison rehabilita- 
tion programs within the framework of sufficient custody, is an objective technique 
to assist in initially identifying newly admitted prison inmates who, for various 
reasons, will become non-adjusted to controlled norms of behavior. 

In the latter part of 1956 an attempt was made at this institution to develop a 
prison adjustment scale from the MMPI based on the hypothesis that propensity for 
non-adjusted prison behavior, apart from escape, derives from a constellation of 
personality factors which are not identified by any single scale, or configuration of 
scales, on the MMPI.“ The Chi-Square technique was employed to determine the 
significance of the difference in which a sample of 56 non-adjusted male prison in- 
mates responded to the 566 items in the booklet form of the MMPI as compared to 
the response of 72 adjusted male inmates. The non-adjusted sample was designated 
the “infraction group’’, and was comprised of inmates who had committed two or 
more infractions of a serious nature, apart from escape. The adjusted sample was 
designated the ‘‘non-infraction group’’, and was comprised of inmates whose records 
were free from infractions; however, escapists were included in this sample if their 
only misconduct had been escape. Forty-two items were responded to significantly 
different beyond the .05 level of confidence by the two groups. In scoring the items, 
one point was allowed for each item answered in the direction in which the infraction 
group differentiated itself from the non-infraction group. The optimal cutting score 
between the raw score distributions of the two groups was established at a score of 
16, which served to identify 80.6% of the non-infraction group, and 82.2% of the 
infraction group. Cross validation was determined on 100 inmates who had com- 
mitted more than one serious rule infraction on prior sentences as compared to a 
group of 100 inmates with records of no infractions committed on prior sentences. 
Application of the 42 item scale to these validative groups served to distinguish 76% 
of the non-infraction group, and 75% of the infraction group. However, further 
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validation performed in November, 1957, with groups selected under the same cri- 
terion identified only 53% of the non-infraction group and 60% of the infraction 
group. The difference between the means of these two groups was not statistically 
significant. 

The purpose of the present study is to refine the 42 item scale and test its vali- 
dity on additional groups that meet with what is administratively considered ad- 
justment or non-adjustment, other than escapism. 


PROCEDURE 


The following items were eliminated from the 42 item scale for the reasons 
stated: 
Item 295—few inmates have ever heard of ‘Alice In Wonderland’, and on the item analysis it 
was left unanswered more than any other item. 
Item 310—a repetition of item 20. 


Items 2, 42, 31, 380—all four of these items appear on the Prison Escape Scale developed at this 
institution in 1955-1956.@)* Since escapists in the infraction group outnumbered escapists 
in the non-infraction group by a ratio of 2 to 1, it was felt that personality variables related 
to escapism had contributed heavily to the empirical selection Z these items. 


The remaining 36 item scale was redesignated the Prison Adjustment Scale 
(Ap). Table 1 lists the 36 items and their direction of response in favor of non- 
adjustment. 

TaBLeE 1. 36 MMPI ITems ANSWERED SIGNIFICANTLY DIFFERENT BEYOND THE .05 


LEVEL OF CONFIDENCE BY 72 ADJUSTED AND 56 Non-ADJUSTED PRISON INMATES 
(ORIGINAL EXPERIMENTAL AND ConTROL GROUPS) 
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The male subjects for validating the scale were randomly selected and consisted 
of (a) 77 adjusted inmates who had not committed a rule infraction nor escaped 
since their admission to prison during the last four months of 1955, (b) a group of 100 
adjusted inmates admitted during 1956, (c) 77 non-adjusted inmates who had com- 
mitted one or more serious rule infractions other than escape (fighting, assault on a 
guard, rioting, refusing to obey orders, etc.) since their admission in the last quarter 
of 1955, (d) a similar non-adjusted group of 100 inmates admitted during 1956 and 
finally, (e) a carefully chosen sample of 27 severely non-adjusted inmates who had 
committed five or more serious infractions while serving on their current sentences. 
All these groups had been administered the MMPI at the time of initial reception. 
Only valid MMPI records were considered in the selection of these cases. Profile 
sheets with validity scale scores above T = 70 were rejected except on the F scale 
where scale scores above T = 80 were disqualified. “ 

A comparison of the MMPI profiles of all the groups revealed a marked similar- 
ity, indicating that the personality variables found to differentiate between adjusted 
and non-adjusted prison behavior are determined independently of any scale or 
group of scales on the MMPI. 


*It is suggested by the author that the prison escape scale be known as the Ec Scale. 
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RESULTS AND DISCUSSION 

Table 2 presents the comparison of the raw score frequency distributions of the 
validation groups on the Ap scale. The point of greatest dichotomy is established at 
a cutting score of 13, which serves to identify 82 percent of both of the adjusted 
groups, and 87, 85 and 93 percent of the non-adjusted groups respectively. It is 
obvious from simple inspection of the table that the scores of the non-adjusted cases 
are distributed decidedly higher than the scores of the adjusted groups. A total of 67 
of the non-adjusted cases are beyond the highest score of any adjusted case. 


TaBLe 2. FREQUENCY DISTRIBUTION oF Raw Scores oF ADJUSTED AND Non-Apsustep Groups 
on 36 ITEM Ap ScALE 


























Adjusted Groups | Non-Adjusted Groups 
Raw 1955 1956 | 1955 1956 Problem 
Score N = 77 N=100 | N=7 N = 100 N = 27 
26 1 
25 
24 1 1 
23 1 
22 3 2 
21 3 1 
20 4 3 2 
19 5 7 
18 11 7 1 
17 5 6 3 
16 1 1 7 14 2 
15 3 3 11 16 2 
14 6 9 10 8 6 
13 4 5 13 16 5 
" (18%) (18%) | (87%) (85%) (93%) 
(82%) (82%) (13%) (15%) ( 7%) 
12 15 21 3 6 
11 14 14 3 3 1 
10 14 11 1 2 
9 + 15 3 1 1 
8 10 12 2 
7 3 7 
6 2 1 1 
5 1 1 











*Point of Greatest Dichotomy 


Table 3 gives a comparison of the raw score means and standard deviations. The 
difference between the means are all significant beyond the .01 level of confidence. 


TaBLe 3. CoMPARISON OF MEANS AND STANDARD DEVIATIONS 














N M SD M Diff. CR F 
1955 Groups 
Adjusted 77 10.68 2.33 
4.58 10.83 <.01 
Non-Adjusted 77 15.26 2.91 
1956 Groups 
Adjusted 100 10.59 2.30 
4.78 11.95 <.01 
Non-Adjusted 100 15.37 3.33 


Problem Sample 
Non-Adjusted 2 
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Age is considered an important criterion in the segregation of prison inmates; 
therefore, it was considered desirable to examine the predictive accuracy of the Ap 
scale at various age levels. Both adjusted groups and the 1955 and 1956 non-adjusted 
groups were combined and then subdivided into the age groups shown in Table 4. 
The skewness of the age distribution is in the anticipated direction as the frequency 
of non-adjusted prison behavior is greater among younger inmates. Although there 
is some shrinkage in the predictive accuracy for the adjusted 16-21 year old group 
and a more marked shrinkage among the non-adjusted 31-43 year old group, no 
special significance should be attached to these differences without further study. 
The means and standard deviations of the various age groups remain consistent and 
there appears to be little justification for the introduction of an age-correction factor. 


TaBLe 4. PrepictivE Accuracy or Ap ScaLE By Acre Groups 











Percent 

Combined Groups (by age) N M SD Identified 
Adjusted (N = 177) 

16-21 35 11.06 2.45 79% 

22-30 70 10.83 2.28 83% 

31-53 72 10.22 2.22 83% 
Non-Adjusted (N = 177) 

15-21 88 15.38 3.06 88% 

22-30 60 15.73 3.25 87% 

31-43 29 14.83 3.14 80% 








The justification for the existence of any predictive instrument is based upon 
whether or not it is able to improve on the predictive criterion already in use. Table 
5 presents the comparison of the predictive accuracy of the Ap scale with the clinical 
prognoses rendered on the 1956 adjusted and non-adjusted groups and the problem 
sample at the time of initial admission. These statements of prognoses were made in 
reference to non-adjusted behavior which is demonstrated by flagrant violations of 
prison rules usually marked by overtly aggressive behavior. They do not reflect pre- 
diction of maladjustment in the sense of functional or constitutional disorders which 
do not manifest themselves through “‘acting out” behavior. It is noted from Table 
5 that the Ap scale does not improve on the prognoses for the adjusted group; how- 
ever, the scale shows an improvement of 26 percentage points over the prognoses for 
the non-adjusted group. The improvement of the scale over the prognoses for the 
problem sample is 15 percentage points. Both of these differences are highly signi- 
ficant. 


TaBLe 5. Accuracy OF PREDICTION - CLINICAL PROGNOSIS 
CompaRED WitH Ap ScALe 








1956 Groups Problem 
Adjusted Non-Adjusted Sample 
N = 100 N = 100 N = 27 





Predicted Non-Adjusted - Prognosis 17% 59% 78% 
Predicted Non-Adjusted - Scale 18% 85% 93% 


Cutting Line 
Raw Score of 13 


Predicted Adjusted - Prognosis 83% 41% 22% 
Predicted Adjusted - Scale 82% 15% 7% 








The test-retest correlation for an adjusted sample of 44 cases retested within an 
interval of a few weeks to more than a year was .86. The intercorrelation with the 
Ec Scale was .04 for the combined adjusted groups and .28 for the combined non- 
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adjusted groups, indicating that the Ap Scale functions with considerable independ- 
ence of the personality variables which are characteristic of escape ‘‘proneness’’. 


TaBLE 6. T-ScorE ConversION TaBLE For Tue Ap Sca.e Basep on 200 

















Raw Score T-Score Raw Score T-Score 
36 155 17 76 
35 151 16 72 
34 147 15 78 
33 143 14 63 
32 138 13 59 
31 134 12 55 
30 130 11 51 
29 126 10 47 
28 122 9 42 
27 118 8 38 
26 113 r J 34 
25 109 6 30 
24 105 5 26 
23 101 4 22 
22 97 3 17 
21 93 2 13 
20 88 1 9 
19 84 0 5 
18 80 

CONCLUSIONS 


1. An attempt has been made to improve on the validity of a 42 item MMPI 
prison adjustment scale developed at the North Carolina State Prison in 1956. After 
reducing the scale to 36 items it was redesignated the Ap Scale. It was applied to 
three additional validative samples selected from the 1955 and 1956 prison admis- 
sions. 

2. The scale correctly identified 82 percent of the adjusted groups and 87, 85 
and 93 percent of the non-adjusted groups respectively. 

3. The scale appears to function independently of the diagnostic MMPI scales 
and the Prison Escape Scale (Ec). It appears equally applicable to youthful offender 
groups as well as to older, more settled segments of the prison population. 

4. The predictive accuracy of the Ap Scale showed no improvement over the 
predictive accuracy rendered by clinical statements of prognoses when applied to 
adjusted groups; however, when applied to non-adjusted groups the scale showed a 
considerable improvement over the prognoses rendered at the time of initial ad- 
mission to prison. 

5. The scale can be utilized as an objective aid in the complex planning of re- 
habilitation and custody programs for newly committed prison inmates. However, 
the author suggests that it should not serve as the sole basis for categorized recom- 
mendations to place or not to place certain individuals under maximum security. 
Any prognostic assertion on the basis of the scale snould be supported by other in- 
dices of prison classification. 
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SECOND-ORDER PERSONALITY FACTORS AND THE MMPI 
SAMUEL KARSON 


University of New Hampshire 


In a recent paper Karson and Pool “® reported a correlation matrix between and 
among the MMPI scales and the 16 P. F. test factors. Now that a second-order 
factor analysis of the 16 P. F. matrix has been accomplished “", some of the informa- 
tion previously reported seems worthy of further study. The purpose of the present 
paper is to attempt to specify more precisely what the MMPI scales are measuring 
in a non-psychotic, outpatient, clinical sample referred for psychological evaluation. 
To facilitate the presentation, Table 1 has been prepared from the intercorrelation 
matrix reported earlier “°, 


Tas Le 1. Pearson CorRELATIONS BETWEEN MMPI Scatgs anp 16 P. F. Factors (N = 71) 








| 
Sixteen P. F. 





MMPI Sixteen P. F. MMPI 

Scale Factor Factor Scale 

? Qs" A Si+ 

L B*+ C L*¥+ O*+ Q4+ B Mf* L*+ 

F C+ I*LM O QI* Q3+ Q4 C LK F+ Hs+ D+ Pd+ Mf+ 
Pt+ Sc+ Ma*+ Si+ 

K C H* I* + L+ M+ 0+ Q3 Q4+ E Mf+ Sc*+ Si*+ 

Hs C+ IL*M 0 Q3+ Q4 F D*+ Mf*+ Ma* Si+ 

D C+ F*+ H*+ 0 Q4 G 

Hy N* O* Q1* Q4* H K* D*+ Mi-+ Pt+ Sc*+ Si+ 

Pd C+ L M* O Q3*+ Q4 I K*-+ F* Hs Mf Pt Sc 

Mf B* C+ E+ F*¥+ H+ILMOQI* L K-+ F Hs* Pd Mf Pt Se 

Q2* Q4 Ma’ Si L*+ 

Pa M K+ F Hs Pd* Mf Pt Sc Si 

Pt C+ H+ILM0Q3+ Q4 N Hy* 

Se C+ E*+ H*+1LM0Q3+ Q4 O K+ F Hs D Hy* Pd Mf Pt 
Sc Si L*+ 

Ma C*+ F* L* Q3*+ Ql F* Hy* Mf* 

Si A+ C+ E*¥+F+H+LM0Q2Q4 Q2 Mf* Si 

Q3 K F+ Hs-+ Pd* + Pt+ Se+ Ma*+ 
Q4 K+ F Hs D Hy* Pd Mf Pt 

Sc Si L+ ?* 








*Significant at 5% level. All others are significant at or beyond the 1% level. 
+Denotes a negative correlation. 


Consideration will be given first to the so-called ‘‘validity scales’ of the MMPI. 


The following correlations were previously reported“°) among these four scales: 
K and F = -.52, K and L = .32, K and? = —.30; F and L = —.19 and F and ? = 
.18; Land ? = .04. Table 1 presents the significant correlations which K shares with 
the 16 P. F. test factors. Of special interest is the fact that every one of these parti- 
cular 16 P. F. factors loads significantly on the second-order personality factor of 
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Anxiety-vs.-Dynamic Integration which has been identified by Cattell and re- 
plicated by Karson and Pool“. Moreover, K does not correlate significantly with 
any of the other factors from the 16 P. F. These observations suggest the hypothesis 
that KX is measuring essentially the same thing that is being tapped by these other 
factors, namely, the second-order questionnaire factor of Anxiety-vs.-Dynamic 
Integration. Moreover, this interpretation of what K is measuring appears to lend 
stronger theoretical support to Fricke’s statement “: »- **) that “it would be expected 
that the r’s of IK with the full scales for D, Hy, Pd, Pa, and Ma would be more positive, 
or less negative, in a normal than in an abnormal sample” than his contention that 
K operates as a measure of the response set to answer false. In this connection it is of 
interest to note that the correlations among these scales reported by Karson and 
Pool“® are similar to the findings of Fricke although only the first three of his five 
expectations were fulfilled. The broader meaning given K may hold some promise for 
unifying its various interpretations as a validity scale®, a measure of defensive- 
ness), a measure of social desirability “*’, and as a response set. In this respect, 
Cattell’s comment © P- 1) to the effect that personality traits may well be deter- 
mining such response sets, seems to be worth pursuing. 

Essentially the same conclusion appears to be true for the F scale, namely, that 
it is largely a measure of the same second-order questionnaire factor as K, except, of 
course, that it taps the opposite pole. F correlates significantly with all of the major 
16 P. F. factors which load most significantly on the second-order factor of Anxiety- 
vs.-Dynamic Integration, namely, Q3, C, L, O and Q4. Moreover, this pattern has 
been replicated in at least three independent studies®’. The correlation of F with 
Q1, which reaches the 5 per cent level of significance, suggests that the F scale may 
also have a weak loading on still another second-order factor. 

The case for the L scale is certainly not as strong as that for K and F as evi- 
denced by its correlations with the 16 P. F. factors shown in Table 1. The correla- 
tion with intelligence has been frequently reported in the literature. Here again the 
evidence suggests that L is measuring essentially the same thing as K and F, namely, 
the second-order anxiety factor, although its loading on this factor is undoubtedly 
not as high as that of K or F. It is also worth noting that K, F and L each has its 
highest correlations among the MMPI scales with Pt and Se. 

What the ? scale is measuring is not clear. Karson and Pool“ report that it 
correlated significantly only with Q4 although its relationships with O and L ap- 
proach significance. Its low positive correlations with E and M, together with those 
already mentioned, suggest that it may be at least partially a weak measure of the 
two major second-order questionnaire factors found to date, namely, Anxiety-vs.- 
Dynamic Integration and Extraversion-vs.-Introversion. 

Since Pt and Se correlate very highly they will be considered together. The ex- 
tremely high correlations“® of each of these scales with the 16 P. F. factors loading 
the second-order factor of Anxiety-vs.-Dynamic Integration strongly suggest that 
both these scales have high loadings on this factor. To a lesser extent the evidence 
suggests that Sc and Pt also provide a much weaker measure of Extraversion-vs.- 
Introversion. 

Hs and D also show certain similarities in their patterns of intercorrelations 
with the 16 P. F. factors. Each appears to be essentially a measure of the second- 
order anxiety factor and perhaps to a lesser extent, of Extraversion-Introversion, 
also. 

Hy is apparently a factorially complex scale since it shows low but significant 
correlations with 16 P. F. factors which load on two second-order factors, anxiety 
and another factor identified by Cattell ®’ as Unbroken Success-vs.-Frustration. 

The pattern of correlations shown by Pd suggests that it is essentially a measure 
of Anxiety-vs.- Dynamic Integration. 

The Mf scale is seen as extremely complex since it correlates significantly with 
twelve factors from the 16 P. F. This pattern of relationships suggests that it loads 
significantly all three second-order factors mentioned previously and possibly two 
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others identified as Sociopathic Deviancy and Latent Homosexuality (Passive) by 
Karson and Pool, 

Little new information is provided for the Pa scale since it has no significant 
correlations with any of the 16 P. F. factors although it approaches significance with 
B, E, Q3 and Q4°. For the present sample which did not include any psychotic 
patients, none of the correlations between Pa and the other MMPI scales reached .4. 

The correlational pattern of the Ma scale suggests that it loads the second-order 
factor of anxiety primarily, and extraversion to a lesser extent. 

Finally, the complex pattern of correlations of the Si scale suggests the hypoth- 
esis that it measures both major second-order questionnaire factors and possibly 
another identified as Independence-vs.-Dependence“?, 

In summary, the purpose of the present study has been to attempt to show 
that through the technique of factor analysis newer and perhaps more fruitful 
hypotheses concerning what the MMPI scales are measuring can be generated.' The 
major conclusion reached was that the K, F and ? scales of the MMPI all appear to 
be measuring the second-order questionnaire factor of Anxiety-vs.-Dynamic Inte- 
gration. 
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A VALIDATION OF MMPI SCALE PAIRS IN PSYCHIATRIC DIAGNOSIS! 
EARL 8. TAULBEE 
VA Mental Hygiene Clinic, Omaha, Nebraska 


In a recent study Taulbee and Sisson“) reported 16 MMPI scale pairs? which 
significantly differentiated two criterion and three validation groups of neurotic and 
schizophrenic patients. Cut-off scores were presented which identified the patients 
at a very high level of confidence, and better than five judges did. The findings were 
discussed with respect to differential diagnosis and behavior processes. The purpose 
of this study is to provide additional data on the diagnostic validity of these scale 
pairs by applying them to new groups of psychoneurotic and schizophrenic patients, 
using the cut-off points reported by the authors. 


PROCEDURE 
The subjects used in this study were all VA patients.* There were 43 neurotics 
from a mental hygiene clinic, 27 neurotics from a general hospital, and 43 schizo- 
phrenics from a neuropsychiatric hospital. The MMPI profile of each patient was 
examined to determine the number of scale pairs present. The profile was then 
classified as being that of a neurotic or schizophrenic, or was considered to be “‘in- 
determinate’’, depending upon whether the total number of scale pairs fell within 

the range of 13 - 16, 0 - 6, or 7 - 12, respectively. 


RESULTS AND DiscussION 
Using the cut-off points suggested by the authors, the results revealed that the 
70 neurotics were classified as follows: 53 (75.7%) as neurotic, 1 (1.4%) as schizo- 
phrenic, and 16 (22.9%) as indeterminate. Of the 43 schizophrenics, 17 (39.5%) were 


classified as schizophrenic, 3 (7%) as neurotic, and 23 (53.5%) as indeterminate. 

These results confirm the validity of the scale pairs in differentiating between 
neurotics and schizophrenics. The primary difference between the results of the two 
studies was the decrease in the percentage of correct diagnosis of the schizophrenics. 
A very probable explanation for this is that many of the schizophrenics in the present 
group were administered the MMPI during or subsequent to the termination of a 
course of somato- or chemotherapy (ECT, IST, tranquilizers). The schizophrenics 
used in the original study were almost entirely new admissions and were tested prior 
to receiving treatment. 


SUMMARY 


In a previous study Taulbee and Sisson reported 16 MMPI scale pairs which 
significantly differentiated between groups of neurotics and schizophrenics. The 
present study presents the results of the application of these to new groups of 70 
neurotics and 43 schizophrenics. The findings offer further evidence for the validity 
of the scale pairs in differentiating between neurotics and schizophrenics. 
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1From the Veterans Administration Mental Hygiene Clinic, Omaha, Nebraska. 

*The scale pairs are: Hs greater than Hy, Pd, Mf, Pa, Pt, Sc, and Ma; D greater than Pd and 
Pa; Hy greater than Pd, Mf, Pa, and Ma; and Pt greater than Mf, Pa, and Sc. 

3Grateful acknowledgment is made to Dr. L. S$. Abramson and Mr. J. R. Donoghue for furnishing 
the MMPI profiles of the hospitalized schizophrenics and neurotics, respectively. 





RELIABILITY-STABILITY OF SUBTEST SCATTER ON THE 
WECHSLER-BELLEVUE INTELLIGENCE SCALES 


RICHARD M. GRIFFITH AND ROY 8S. YAMAHIRO 


Veterans Administration Hospital, Lexington, Kentucky and the University of Kentucky 


PROBLEM 
Clinicians citing evidence for or against the diagnostic validity of patterns of 
subtest deviations on the Wechsler-Bellevue Intelligence Test have never established 
one prerequisite for validity, the reliability and /or stability of the scatter. The re- 
liabilities of the separate subtest scores, determined in several studies’, do not 
guarantee consistency in relationships between subtest scores. 


MeETHOD 

The files of the Clinical Psychology Service yielded 55 records of patients who 
had been tested and retested after an interim period of at least one year with any 
oi the three ferms of Wechsler’s scales (WB 1, WB II, WAIS). At the time of first 
testing, the pai. nts ranged in age from 20 to 62 years, with a mean of 30.6. Their 
full-scale 1Q’s ranged between 61 and 131, averaging 92.8 compared with the mean, 
for all patients tested in the hospital, of 93.6, as determined from a sample of 55 
additional records chosen randomly. Diagnoses were varied: Schizophrenia, 24; 
Personality Disorder, 11; Chronic Brain Syndrome, 9; Neurosis, 6; Other, 5. The 
group seems fairly representative of the hospital population, with the exception that 
it included, by selection, only “chronic” patients who had remained in the hospital 
for at least one year or who had been readmitted. The interval between test and 
retest was in some cases as long as 10 years; the average was 42 months. The rank- 
order coefficient of correlation (rho) between the 11 subtest scores for test and retest 
was calculated for each of the patients. 


RESULTS 
Rho’s ranged from —.30 to +.96 with an average of +.51. Only three of the 55 
rho’s were negative; 48 were greater than +.25. 


Effect of form of scales. For ease in reporting results, the patients are classified in- 
to two groups according to the form of the scales used in test-retest. In Group A, the 
patient received the same form of the scale on retest; in Group B, a different form. 
Group A (N = 32) consisted of 30 cases tested twice with the WBI and 2 cases 
tested twice with WBII. Group B (N = 23) consisted of 8 cases tested first with WBI 
and later with WBII, and 15 cases tested first with WBI and later with WAIS. The 
average rho for Group A (.60) significantly exceeded the rho for Group B (.40) at 
the .02 level, as computed by the Mann-Whitney U Test. 


Effect of length of period between test-retest. For Group B, the length of the 
interim period (it should be remembered that the minimum was one year) did not 
seem to matter. However, within Group A, profile similarity dropped with increased 
time (Table 1). 

Tasie 1. AVERAGE RuHo IN RELATION TO INTERVAL 
3ETWEEN Test-Retest (Group A) 


Months N Rho 


12 to 24 11 1° 
24 to 36 10 57 
36+ 11 .500 


*First and last groups differ at .05 level by U test. 
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Rho in relation to diagnosis. Schizophrenics, who are often said to be character- 
ized by variability of performance, actually had slightly higher rho’s. No significant 
differences were found. 


Correlations “‘between’”’ individuals. The rho was computed between the test 
scores of each patient and the retest scores of the patient who was next in alpha- 
betical order. Group A and Group B were treated separately; Group A, for which the 
test and retest scales were the same, was, oddly enough, the lower of the two, but the 
difference did not approach significance. The average rho for both groups combined 
was +.16. There were 19 negative and 36 positive values, a dichotomy which differs 
at the .02 level (Chi square) from the 50-50 split expected if there were no group 
profile similarities. Thus, the correlations previously determined in part may be at- 
tributed to group rather than individual factors, a condition which would reduce the 
diagnostic value of profile patterns. However, the average relationship of +.16 
compares with the +.51 above; the rho’s “within” individuals are significantly 
larger then the rho’s “‘between”’ individuals, beyond the .01 level (U test). 


Reliability of IQ and practice effects. Since the interval between test and retest 
is longer in this study than in most others, the values for the reliability of the full- 
scale 1Q are included: For Group A, Pearson r = +.89; for Group B, r = +.85. In 
Group B, the practice effect would seem negligible, at least over these long time 
spans. When the same form of the test was readministered (Group A), the practice 
effect apparently persisted for some time, for the number of gains in IQ on retest 
outnumbered the losses 19 to 9 (4 being equal). This division is just shy of signifi- 
vance at the .05 level (Chi square) by the two-tailed test, comfortably significant 
by the one-tailed. A regular trend lends credence to the relationship: Patients re- 
tested after one to two years averaged an increase of 5.5 1Q points; after two to 
three years, 3.2 points; beyond three years, 0.8 points. 


DIscUSSION 

Over periods, the length of which demanded stability of performance aside 
from original reliability, neuropsychiatric patients were characterized by their sub- 
test patterns. The strength of the relationships are more striking when one allows 
for examiner influence and takes into account that at least some of the factors of 
diagnostic significance (e.g., anxiety level) may be transitory in nature. Actually, 
one might make a case that—if it stood the test of similar studies—the highest value 
found above (.71) for those who were readministered the same form after one to two 
years) is lower than the true value. All factors would decrease and none increase the 
size of the relationships. It may be concluded that, whether the patterns of devia- 
tions do or do not have personality or psychodiagnostic validity, the reliability is 
such that they might have. 


SUMMARY 

The rank-order coefficient of correlation between subtest scores was computed 
for each of a group of hospitalized veterans who had been retested with one of the 
various forms of Wechsler’s intelligence scales after periods extending from one to 
10 years. Rho’s for all subjects averaged +.51. Both the form of the retest scale 
relative to the form of the test scale and the length of interval between test and 
retest influenced the degree of profile reliability. A slight, though not nearly as large, 
tendency for profile similarity between members of the group was found. 
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DENTAL CARIES AND PSYCHOLOGICAL FACTORS 
JOHN H. MANHOLD, JR. AND A. JACK HAFNER 


Seton Hall College of Medicine and Dentistry Washington University School of Medicine 


PROBLEM 


Application of the psychosomatic concept to dental pathology has been the sub- 
ject of speculation for many years. This was not empirically investigated in a 
“normal” population until 1949 and since that time there have been a number of 
studies in this area“: > ® 7. 8 1), This work has led to the formulation of the follow- 
ing hypotheses: ‘‘Psychosomatic affection, as applied to dentistry, is either the re- 
sult of neurotic processes leading to phys®logical functions or dysfunctions which 
sause the pathological condition, or is resultant of ordinary neglect which is itself 
directly dependent upon traits of personality.’ ». 13%) 

The purpose of the present study was to investigate the psychosomatic concept 
as applied to a population of individuals with emotional difficulties. It was hy- 
pothesized that a group of individuals receiving out-patient psychiatric treatment 
should have a higher incidence of dental pathology than individuals not receiving 
psychiatric treatment. It was also hypothesized that the individuals whose emotional 
problems were of a chronic nature would have a higher incidence of dental pathology 
than those whose emotional] problems were acute in nature. 


PROCEDURE 


A visual and roentgenographic examination was made of the mouths of all avail- 
able white out-patients under psychotherapeutic treatment at the Psychiatry Clinic 
of the Washington University Clinics in St. Louis. Eighty-one patients were ex- 
amined during the period of the study with a mean age of 35.6 years. The same ex- 
amination procedures were employed with a control group of 81 white out-patients 
selected randomly from other medical clinics of the Washington University Clinics. 
The mean age of this group was 39.4 years. From the oral examination of each 
patient in the two groups a DMF Teeth rating ®’ was determined, the DMF Teeth 
rating being an accepted measure of dental pathology. 

An additional control group was also employed which was comprised of 1580 
white 18 and 19 year old subjects from the Greater St. Louis area. In order to adjust 
for the difference in age level between this group and the psychiatric group’s mean 
age of 35.6 years, as age and dental pathology are significantly correlated, the mean 
DMF Teeth for this control group was adjusted upwards by a value of .32 teeth per 
year. This correction figure was based on an estimate of St. Louis public health 
officials. The average yearly increase in D MF Teeth of .32 is consistent with findings 
in other populations studied in this country.! 

The psychiatric out-patient group was also divided into a chronic and acute 
group with chronicity being defined as the presence of the major psychiatric sym- 
tomatology for one year or more. In addition, a clinical diagnosis was made for each 
patient using psychiatric screening interview materials. This diagnosis was made in 
an attempt to differentiate between the various psychiatric sub-groups on the basis 
of dental pathology. 


RESULTS 


A critical ratio test was the statistic used to make comparisons between the 
psychiatric and control groups. The mean DMF Teeth rating for the psychiatric 
out-patient group was significantly greater (p < .01) than that for the non-psychia- 
tric out-patient group and for the greater St. Louis area group. The results of these 
comparisons are presented in Table 1. 
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TABLE 1. COMPARISON OF CONTROL AND PsycHIATRIC OUT-PATIENT GROUPS 
witH Respect TO DMF Trers Ratincs 


Mean 
N Mean Age DMFT 


Control Group I Sl 35.6 13.53 
(medical out-patients) (adjusted) (adjusted) 


Psychiatric Out-patients 81 35.6 18.38 


Control Group II 1580 39.4 15.19 


*Significant at the .01 level 


The difference in the mean DMF Teeth rating between the chronic and acute 
psychiatric sub-group as determined by a ¢ test was not significant. In the breakdown 
into diagnostic groups, a comparison was made only between the sub-groups of 
anxiety reaction, hysteria and depressive reaction as the number of patients falling 
into the other various diagnostic groups was too small to allow for a meaningful 
comparison. An analysis of covariance was used in the diagnostic group comparisons 
to control for significant age difference between groups. The resulting DMF Teeth 
rating differences between the diagnostic groups were not significant. The results of 
these analyses are presented in Tables 2 and 3. 

TABLE 2. CoMPARISON OF DMF Treru Ratinas ror AN ACUTE AND CHRONIC 
PsYCHIATRIC OUT-PATIENT GROUP 


Group N Mean Age 


t ratio 


Mean DMFT _ tratio 


Acute 19 34.6 


Chronic 46 


TABLE 3. Comparison oF DMF Teeru RatineGs For PsycHIaTRIC OUT-PATIENT 
Dr1aGcnostic Groups 


Mean Age Mean DMFT Covariance F 
Anxiety Reaction 34.7 
Hysteria 19 36.3 5.6: 2.12 
Depressive Reaction 8 49.3 





CONCLUSIONS 


The findings of the present study that people with emotional problems have a 
higher incidence of dental caries than a group with medical problems and a group 
from the general population lend support to the application of the psychosomatic 
concept to dentistry. The lack of differentiation of the incidence of dental caries be- 
tween the psychiatric sub-groups in this study would seem to be related to the usual 
difficulties encountered when trying to employ a psychiatric classificatory system. 

While the positive findings of this study are consistent with the psychosomatic 
concept, further research is necessary to ascertain the specific relationships between 
psychological factors and dental caries. 
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ROLE-TAKING ABILITY AND SCHIZOPHRENIA 
M. DAVID DIAMOND 
Riverside Hospital, New York, N. Y. 


PROBLEM 

The importance of role-taking ability for normal social adjustment, and the 
significance of deficiency in the ability in various mental! disorders, recently has been 
widely emphasized. A disturbance in role-taking ability has been especially noted in 
schizophrenic disorders®’, Cameron“? believes that schizophrenic disorders de- 
velop in persons who have failed to acquire adequate ‘‘role-taking”’ skills. Sarbin “ 
found that the variability in the number of roles taken is extremely limited for schi- 
zophrenic patients. Shor“: »- 4) observed that the ‘‘schizophrenic . . . usually re- 
fused to accept any externally suggested fantasy situation” in psychodrama sessions 
at a neuropsychiatric hospital. 

For the purposes of this study, empathy or role playing may be considered as a 
“process of relatively brief identification whereby, in a conscious or unconscious 
fantasy, one merges himself with another to understand and share the other s feelings 
and attitudes,” 4: P. 47) 


PROCEDURE 

A multiple choice modification of the Rosenzweig Picture-Frustration Test was 
used to measure ability to predict the behavior of specified individuals in symbolic 
interpersonal situations. This technique was selected because the cartoon situations 
are interesting, required reading level is low, and representative responses are avail- 
able. The modification of the Rosenzweig Picture-Frustration Test which was ad- 
ministered supplied three multiple choice responses for each of the 24 situations. The 
modification was constructed for this study. The choices given for each picture were 
chosen from examples in the scoring manual. One of the choices for each picture is 
clearly extrapunitive, one impunitive and one intropunitive. The order of the three 
choices varied from picture to picture. 

Subjects were given brief descriptions of three people. The descriptions were 
constructed by the investigator so as to be of extrapunitive, impunitive and intro- 
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punitive individuals respectively. To verify the relevancy of these descriptions, 
three clinical psychologists were asked to respond to the test as the described in- 
dividuals would be expected to respond. They gave predominantly extrapunitive, 
impunitive and intropunitive responses, respectively, for the three individuals. 

For pictures 1-8 the subjects were asked to respond to the multiple choice situa- 
tion as the first person described (the extrapunitive one) would respond; for pictures 
9-16 as the second person described (the impunitive one) would respond; and for 
pictures 17-24 as the third person described (the intropunitive one) would respond. 
Directions to subjects were: 

Bob R., Ralph G. and John L. all applied by mail for jobs in a business firm. Their le ‘tters 
of application and their job backgrounds seemed very similar so each man received a letter asking 
him to come for a personal interview with : a Mr. Brown at a certain time and place. Following are 
descriptions of their interviews. Read each one carefully and 5g? to get an idea of what these men 


are like. Pictures 1-8 (or 9-16 or 17-24) are the pictures Mr. Prown showed Bob (or Ralph or 
John). I would like you to circle the answers the way you think he did. 


The number of responses agreeing with the category into which the description 
fell was obtained for each subject. The degree to which the subject’s responses 
agreed with those of the category selected by the judges provided a measure of the 
ability to predict the behavior of specified individuals in symbolic interpersonal 
situations. 


Subjects. The performance of 30 male schizophrenic patients was compared with 
that of a group of 30 male non-schizophrenic drug addicts. The schizophrenic 
patients were selected from the psychiatric male population at Bellevue Hospital. 
Only those patients were selected who had been diagnosed as schizophrenic by the 
psychiatric staff. Deteriorated or ‘‘latent’’ schizophrenics and subjects undergoing 
electro-shock therapy were excluded from the sample. The criteria listed in the 
American Psychiatrie Association Diagnostic and Statistical Manual: Mental Dis- 
orders? are used at Bellevue Hospital for hospital diagnosis. Ten were classified as 
schizophrenic reaction, paranoid type; nine as schizophrenic reaction, childhood 
type; seven as sc hizophrenic reaction, simple type; and four as schizophrenic re- 
action, chronic undifferentiated type. 

A control group was sought which manifested a variety of types of psycho- 
pathology to eliminate the possibility that the hypothesized deficiencies in inter- 
personal skills were concomitant with emotional disturbance per se. Drug addicts in 
a rehabilitation center were selected as control subjects, since, as a group, they pre- 
sent a variety of emotional disturbance. The drug addict control group was selected 
from the male patients at Riverside Hospital, a treatment center for adolescent drug 
users. No individual classified as schizophrenic was selected for the control drug 
addict group. Of the thirty drug addicts, eight were classified as inadequate person- 
ality; eight as schizoid personality; six as passive-aggressive personality, and one was 
classified in each of the following eight categories: adjustment reaction of adoles- 
cence, adjustment reaction of childhood, mixed psychoneurosis, neurotic character 
disorder, personality pattern disturbance, personality trait disturbance, psycho- 
neurotic reaction and sociopathic personality. 

The control subjects were matched with the schizophrenics with respect to age, 
highest school grade completed, intelligence, race and socioeconomic background. 
The mean ages of the schizophrenic and drug addict groups were 19.3 and 18.5 

respectively; the mean highest school grade completed, 9.6 and 9.9 respectively; the 
mean IQ’s 94.1 and 98.8 respectively. The two groups did not differ significantly 
with respect to any of these three characteristics. Both groups consisted of about 50 
per cent Caucasion, and 25 per cent each of Puerto Rican and Negro individuals, 
primarily of lower socio-economic background. 
1Appendices containing the multiple choice items and personality descriptions may be obtained 
from the author. 
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RESULTS 

The number of responses agreeing with the category into which the described 
person fitted was obtained for each subject (Table 1). The difference between the 
schizophrenics’ mean of 12.3 and the drug addicts’ mean of 14.8 was significant at the 
.01 level, thus supporting the hypothesis that schizophrenic individuals predict the 
behavior of other individuals in symbolic interpersonal situations less accurately 
than do non-schizophrenic drug addicts, 

The number of correct responses given for each of the three descriptions is also 
presented in Table 1. The mean number correct for the extrapunitive description 
was 4.6 for schizophrenics and 6.0 for drug addicts, with the difference between 


TABLE 1. PREDICTION OF BEHAVIOR OF SPECIFIED INDIVIDUALS BY SUBJECTS 
(30 SCHIZOPHRENICS AND 30 Drua Appicts) 


Number of “Correct’’ Responses 


Type of Schizophrenics Drug Addicts 
Individual Mean 8.D. Mean 8.D. 
Described 


Extrapunitive ) 2.§ 6.0 


Impunitive : 4. 
Intropunitive 3.$ 6 4 


Total 
Sum of Means 12.3 3.¢ 14.8 
*Difference significant at .05 level 
**Difference significant at .01 level 
means statistically significant at the .05 level. The mean number correct for the im- 
punitive situation was 3.8 for schizophrenics and 4.1 for drug addicts, with the 
difference between means not statistically significant. —The mean number correct for 
the intropunitive situation was 3.9 for schizophrenics and 4.7 for drug addicts, with 
the difference between means statistically significant at the .05 level. Thus the drug 
addicts predict the behavior of described extrapunitive and intropunitive individuals 
better than do the schizophrenics. The insignificant difference between groups for 
the impunitive response may be due to the fact that an impunitive mode of response 
represents an avoidance of the frustrating situation, while the extrapunitive and 
intropunitive modes of respense are more direct ways of reacting to situations. Be- 
cause they reflect an avoidance of the frustrating situation, the cues given in the 
description of the impunitive person were more subtle than those given in the des- 
criptions of the extrapunitive and intropunitive persons. 


DiscussIoN 

An extension of Mead’s® viewpoint furnished one framework within which 
the results might be integrated. In Mead’s theoretical system, the normal ‘‘self”’ is 
developed largely through the persons’ taking the role of others, thus forming ex- 
ternally validated definitions of the self and the world about one. Since the schizo- 
phrenic is deficient in role taking, as has been experimentally demonstrated, Mead’s 
theory would hold that the development of the “‘self’’ in the schizophrenic is ab- 
normal. In a sense, classical Freudian theory provides a similar interpretation in 
that schizophrenics are regarded as regressed to a period where the ego is weak or 
non-existent. As in Mead’s theory concerning development of the “‘self”’, the ego and 
super-ego are viewed as developing largely through a process similar to that of 
“taking the role”’ of others, 7.¢., the process of identification. 


SUMMARY 
The present study tested the hypothesis that schizophrenic individuals predict 
the behavior of specified individuals in symbolic interpersonal situations less accur- 
ately than do non-schizophrenic drug addicts. The performance of thirty hospital- 
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ized male schizophrenics was compared with the performance of thirty hospitalized 
male drug addicts, who were similar to the schizophrenics with respect to age, highest 
school grade completed, intelligence, race and socioeconomic background, but who 
had never been diagnosed as schizophrenic. 

Using a modification of the Rosenzweig Picture-Frustration Test in which sub- 
jects responded to a multiple-choice form of the test as three described individuals 
would respond, schizophrenic subjects predicted responses of the described individ- 
uals significantly less well than did non-schizophrenic drug addicts. In general, the 
results confirmed the hypothesis that, as compared to non-schizophrenic emotionally 
disturbed individuals, schizophrenics are deficient in the ability to predict the res- 
ponses of other persons in interpersonal situations. 
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STEREOTYPES OF NATIONALITIES BY STUDENT NURSES! 
THOMAS F. GRAHAM 
Massillon (Ohio) State Hospital 


PROBLEM 


This study investigates the influence nationality may have upon the selection of 
a registered nurse for an hypothetical assignment. The objectives of the present 
research are as follows: (a) To design a relatively innocuous method which elicits 
stereotype reactions in a given area. (b) To clarify some of the psychological factors 
involved in the preference of one nationality over others. 


METHOD 


Ten classes totaling 367 student nurses, met once weekly during the last month 
of their affiliation in psychiatry at Massillon State Hospital. The number of sub- 
jects in each group ranged from 30 to 46, mean of 36.7. These meetings were held to 
discuss personal, social and professional adjustments. No academic credit was given 
for attendance. 

The students represented seven nursing schools in Ohio, i.e., St. Thomas, Akron; 
Aultman, Canton; Massillon City, Massillon; Mercy, Canton; Salem City, Salem; 
Bethesda and Good Samaritan both of Zanesville. Five groups were from Alabama; 
Anniston Memorial, Anniston; Birmingham Baptist and Carroway Methodist, 
Birmingham; Druid City, Tuscaloosa and Sylacauga. Indiana included two; Mem- 
orial of South Bend and Protestant Deaconess of Evansville. Directions given to 
each class were as follows: 

“This is an hypothetical problem. You are to select a registered nurse to care for your convales- 
cent fiance from the following national groups which are listed in alphabetical order: (a) English, 
(b) French, (c) German, and (d) Italian. All the nurses are well trained, speak good English, and 
are equally attractive. Tell in a few words why you selected one and not the others.”’ 


‘With appreciation for the assistance of James W. Zingery, Robert B. Balch and Mary-Jean 
Kanagy. 
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RESULTS 

The first major objective was to elicit stereotype responses in a given area. Re- 
sults of national preferences expressed by the groups of student nurses are sum- 
marized in Table 1. Chi square values were determined for four categories. An inter- 
TaBLE 1. Cuorce or NATIONALITY ExpressED BY TEN Groups OF STUDENT NURSES IN TERMS OF 
Cur Squa 








Groups English French German 
X? l.o.¢. x? l.o.¢. x? l.0.¢. 
A | 24.3 .001 
B 5.3 Ol 
C 6 .001 
D 5 001 
E + .30 
F } 2.8: .50 
G | 4. .20 
H } 19. .001 
I | 11.6 01 
J 19.20 .001 
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group analysis revealed a range of X* = 46.15 (Group D) beyond the .001 level of con- 
fidence to X? = 2.83 (Group F) beyond the .50 level of confidence, selecting the 
English. Conversely, significant relationships were found in not choosing the French 
with X? = 11.50 (Group G) beyond the .01 level of confidence to X* = 3.34 (Group 
E) beyond the .50 level of confidence. 

In preference for the German nurse the range was X? = 9.59 (Group G) beyond 
the .05 level of confidence to X? = .30 (Group A) beyond the .98 level of confidence. 
A significant deviation was also found for the Italian in which X? = 7.12 (Group I) 
beyond the .10 level of confidence to X*? = .16 (Group F) beyond the .99 level of 
confidence. 

Excluding the French, Group F was nearest the expectancy responses. How- 
ever, all groups of subjects showed marked stereotype reactions. Of the 367 students, 
208 (56.7%) selected the English nurse, 10 (2.7%) chose the French, 104 (28.3%) 
picked the German, and 45 (12.3%) the Italian. 

Next, an attempt was made to clarify some of the psychological factors involved 
concerning the choices. As stated in the directions, an inquiry followed to reveal the 
“rationale” of the subjects’ responses. Table 2 lists the stereotypes associated with 


TABLE 2. REASONS GIVEN BY THE STUDENTS FOR THEIR CHOICES 





Reasons 


Nationality Selected Not Selected 
Group Trait Percent Trait —_—_—=é&Perrcentt 


English Confident 071 Aloof .136 
Standards .112 Lack humor 079 
Maturity .038 Phlegmatic O11 
Descent .346 Proud .207 
French Affective O11 Amorous .251 
Allies .005 Emotional 142 
Generous .003 Flighty .237 
Sociable .008 Sexy 34° 
German Dependable .090 Dominant 
Efficient .032 Severe 
Industrious .098 Strict 
Quiet .063 War-like 
Italian Considerate .030 Aggressive 
Friendly .049 Careless 
Mother figure 005 High strung 
Religious .038 Unstable 
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each nationality. In this regard the percentage of student nurses applying more or 
less dichotomous traits to their choices are also given. 

It is of interest that one of the ten girls who selected the French nurse gave as her 
reason, “I thought of the Grey nuns, an order of outstanding nurses’’ (generous). 
Another was reminded of the heroine of Dien Bien Fu (allies). One said her choice 
was determined by the fact that her own mother was French (affective). However, 
at the other extreme, three students wanted to test their future husbands to deter- 
mine if they would remain loyal. Therefore they ‘‘reasoned’’ the men would be faith- 
ful after marriage if they resisted the stereotyped charms (affective) of the French 
nourrice. 


CoNcCLUSIONS 

A simple method was designed to elicit preference responses in a given area. An 
attempt was also made to define some of the factors contributing to the acceptance 
or rejection of one nationality for a nursing assignment as compared to others. 
Results of the investigation suggested the following general conclusions. The data 
elicited may be quantified and analyzed objectively (Table 1) and semi-objectively 
(Table 2). Moreover, the fact that stereotype reactions could be recognized and 
measured suggests the possibility that the method employed may be an additional 
device for the investigation of national relations. From the viewpoint of applica- 
tion, the technique may also serve as a stimulus, and increase rapport for subsequent 
group discussions designed to foster a better understanding among individuals. 


THE MATURATIONAL FACTOR IN HUMOR! 
LEO R. GRAHAM 


2680 Saturn Ave., Huntington Park, California 


This investigation attempts to dimensionalize the personality trait commonly 
referred to as the “sense of humor” and was specifically concerned with the investi- 
gation of the perception of humor among children of different elementary school 
ages. In order to circumscribe the area explored in this study, humor was defined as 
any object or situation, or any part of an object or situation, which would elicit, 
directly or indirectly, the language response, ‘‘This is funny,’’ or some equivalent 
expression. From this definition of humor it may be inferred that this study was not 
concerned with the nature of humor but with the differential responses of children 
to a potentially humorous stimulus. Our hypothesis is that a relationship exists be- 
tween the perception of humor and the level of growth and development of the 
personality. 


MetTHOD 

It was decided that the problem of selecting the ‘funniest”’ one of four cartoons 
would validly differentiate responses of children. After consultation with teachers 
who had considerable experience with children in the primary grades and who agreed 
in this judgment, it was arbitrarily decided that a modified forced-choice situation 
would be used. The children would not be required to rank the cartoons nor would 
they be forced to make a choice if they decided that none of the cartoons were 
funny. However, if they thought that two cartoons of a group were funny, they 
would have to chose one. 





1The author wishes to acknowledge the assistance of M. Bruce Fisher, J. A. Buehler, and R. J. 
Howell in the preparation and execution of this study. 
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Approximately 200 cartoons, presumed to be funny, were gathered from maga- 
zines or newspapers and pasted on notebook paper in sets of four on each page. After 
administering the cartoons, in the manner described below, to the writer’s children 
(a daughter, aged 9, and a son, aged 6,) and two adults, it was decided to reduce the 
number of cartoons. The children and adults complained that after looking at 100 
‘artoons they lost interest in the task. The number was arbitrarily reduced to 84 
cartoons in 22 sets to minimize this tatigue by reducing the work period. 

A second reduction was decided upon after 10 children from each of the three 
grades used in the sample were tested. The children still seemed to lose interest be- 
fore the 22 sets were viewed. It was considered more desirable to reduce the length 
of the task than to resort to artificial means to stimulate interest. The cartoons 
eliminated were those which were chosen by all three grades as the modal selection. 
In these cases the entire set of four cartoons was eliminated. The cartoons were re- 
duced to 44 on 11 pages. 


Subjects. The sample for this study consisted of 60 pupils selected from the Ist, 3rd, 
and 5th grades of the College Elementary Demonstration School, Fresno State 
College, 20 pupils from each grade. Although a systematic randomization was not 
used, an effort was made to choose pupils in an unbiased manner. Jf there were any 
unconscious tendencies towards special selection, they probably operated in selecting 
all the pupils. The sample included about 80% of the enrollment in each grade. 

The children were white, American-born, and monolingual from urban, upper, 
or upper-middle class homes. No effort was made to control the selection as to 
nationality of descent, religious affiliation, or educational achievement other than 
that indicated by grade placement. 


Procedure. The examiner and the subjects sat side by side during the administration 
so the subjects could feel more at ease and could point to various items which they 
considered humorous. A nondirective attitude was maintained by the examiner in 
the belief that too much enthusiasm might influence the children to pick cartoons 
which they thought the examiner might consider humorous. Each set of four car- 
toons was shown to each subject with the same question until the responses became 
spontaneous. If the subject chose two cartoons as equally humorous, the examiner 
insisted on a choice of one. 

The chi square technique was used to test the significance of the differences be- 
tween the three groups in selecting cartoons as most humorous. Each item, 2.e., 
each set of four cartoons, was compared separately and the test as a whole was 
treated similarly. In each comparison the grades were grouped to give a four-fold 
contingency table. The Ist grade pupils were compared with 3rd and 5th grade 
pupils and the Ist grade modal choices were compared with the summation of all 
other choices. This scheme was used for each grade. 


RESULTS 
The chi square for the Ist grade was 43.0, for the third grade 15.8, and for grade 
five 12.3. The P values with one degree of freedom indicate that the intergrade 
differences among the children in the perception of humor, as evaluated by their 
choices of cartoons, are significant beyond the .001 level of confidence. The hy- 
pothesis, therefore, may be accepted. 


DIscussIoN 

These findings may suggest that the additional experience and/or maturation 
which takes place between the different grades increases the number of objects or 
situations which will be perceived as humorous. The fact that only one set of the 
cartoons was significantly useful in separating the 5th grade pupils from the 1st and 
3rd grade pupils tends to substantiate this impression. 

While this study was concerned with cross-sectional data which do not permit 
analysis of individual trends, longitudinal data might make it possible to discover a 
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personality differentiation among individuals which would account for the wider 
dispersion found in the 3rd and 5th grades. It has been suggested by the study of 
Cattell and Luborsky“? that adults do tend to favor different kinds of jokes on such 
a basis. 

The larger chi square for the Ist grade also suggests that these younger pupils 
tended to select cartoons which were markedly different from those selected by the 
other two grades. Apparently there is not only greater homogeneity in the percep- 
tion of humor among the Ist grade pupils, but there is also a shift in the types of 
thing perceived as funny with increased maturity and/or experience. While the 
nature of this shift was beyond the scope of this investigation, it should receive 
further study. 

As part of the investigation the subjects were asked to explain why they chose 
the different cartoons, but their answers revealed the perplexity and lack of insight 
commonly associated with an attempt to explain why anything is funny. It seems 
to be a peculiarity of humor that when we try to explain it, it disappears. This in- 
ability to explain why the cartoon was funny was found to be evenly distributed over 
the three grades. 


SUMMARY 


This study tested the hypothesis that a relationship exists between the per- 
ception of humor and the level of growth and development of the personality. Sixty 
pupils from the Fresno State College Elementary Demonstration School were ex- 
amined using 44 cartoons arranged in 11 sets of four. Each subject indicated which 
cartoon in each set he considered to be the funniest. The computation of chi squares 
from four-fold contingency tables reveals significant differences among the choices 
of the children in the three grades. Therefore, the hypothesis may be accepted. The 
study indicates a direction which may be taken in the preparation of a test of the 
perception of humor by children in the elementary school grades. 
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RECOVERY FROM PARANOID DELUSIONS FOLLOWING 
HYPNOTIC UNCOVERING OF REPRESSED EPISODES 


JAMES C, STAUFFACHER 


Veterans Administration Hospital, American Lake, Washington 


INTRODUCTION 
This case is presented because of the light thrown upon the dynamics of paranoid 
delusions and its implications for the therapeutic management of such cases. Ap- 
parently, this patient achieved a complete social recovery although the basic homo- 
sexual conflict was not brought to the awareness of the patient. Further, the thera- 
pist did not deal directly with the transference or with infantile memories. 


Case History 

The patient, ““Mr. Q’’, was referred for group therapy since he had not responded 
to the usual forms of somatic therapy, including electro-convulsive and insulin con- 
vulsive treatment. Some of the details concerning the patient’s experience and his 

background have been altered in order to maintain anonymity. 
Mr. Q was 28 years of age, married (apparently happily, until the outbreak of his psychosis), 
had an engineering background and before his entry into the service had an industrial position of 
considerable responsibility. In fact, his position was so incredible for one of his years that his 
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account of his pre-service occupational activities was felt to be based on psychotic delusions until 
their veracity was verified by his former employer and his wife. 

Mr. Q was a slight but intense individual who was quite aloof, but who manifested his high 
intelligence on those rare occasions when he could be drawn into group discussions. When this 
happened, the topic of conversation was always an impersonal one carried on in terms of general- 
ities with little or no real emotional involvement on the part of the group members. Most of the 
time Mr. Q sat without participating and giving the impression that, as far as he was concerned, 
the activities of the group were meaningless, although they were all well and good for the others 
since they were “‘crazy’’ and needed therapy. Obviously, he had no insight concerning the pres- 
ence of any condition that might require hospitalization or treatment. In his opinion, he had been 
incarcerated at the instigation of his wife (this was correct) who wanted him out of the way in 
order that she might continue her many sexual escapades (these apparently occurred only in the 
patient’s mind). 


The Opening Wedge. This situation existed for several weeks with no indication of any 
change until the following episode occurred. One member of the group was a young veteran, who 
had been a naval aviator during his service experience, well on the road to recovery and talking 
entirely rationally. He had a fine sense of the dramatic and had the ear of the other members of 
the group, particularly when he recounted tales involving his service experiences. This is the story 
he told one day as it had been told to him by one of his Navy buddies: A young friend of his had 
been making regular runs out of one of our western ports. He was in the habit of calling his wife 
upon his arrival so she could pick him up in the family car. He arrived around 2:00 one summer 
morning and, upon seeing an available taxi, decided that he wouldn’t bother his wife. Upon 
reaching the apartment building he went around to the rear and went up to the back door of the 
apartment. It was a warm summer night and he found the back door ajar, but the screen door was 
secured by a short chain lock. There was enough light so that he could see through the kitchen 
into the bedroom. To his consternation he saw his wife lying nude, in the arms of another man. 
The young officer was newly married and very much in love with his young bride. He had had no 
previous indication of the possibility of infidelity on her part. Naturally, he was very much dis- 
turbed and began to shout angrily and to bang on the door. The other man was aroused and 
asked what he wanted. The young officer discovered that in his haste he had stopped at the 
apartment just under his, a mistake that was easy since the physical layout was identical on each 
floor. This story was wel! told and created quite an impression on the group. During the ensuing 
discussion the narrator (who had considerable insight) suggested that many of the mistaken 
ideas that emotionally disturbed people develop might have resulted from similar circumstances. 

The group broke up for the day shortly and Mr. Q came up and walked with the therapist to 
the building where his office was located. This was unusual since he ordinarily preferred his own 
company. He walked along for some time without saying anything. Finally he said, ‘““Do you 
suppose it could be possible that I have been mistaken about my wife’s infidelity? She never 
gave me any reason to suspect her before I went overseas and I did arrive home in a rather upset 
condition.”” The therapist replied something to the effect that this was a possibility meriting 
further consideration and invited Mr. Q to discuss the matter further in his office. In spite of the 
fact that he had a therapeutic appointment with another patient for that time, he took Mr. Q to 
his office immediately because he felt that he might never get another opportunity if he allowed 
this one to pass. 

The patient found it extremely difficult to begin talking about his problem. The therapist 
was probably over-cautious and sat quietly for quite a time before saying anything. He then 
asked Mr. Q why he felt that there was a possibility that he might be mistaken about his wife. 
The patient quickly retorted that there was no question about his wife’s indiscretions. He seemed 
about to burst into a long angry harangue, but he pursed his lips and sat rigidly for a time. Then 
he looked up with a wry smile and said, “I guess that’s what all patients think. At least the 
judge thought I was wrong or I wouldn’t have been committed. But, how could I be so sure and 
still be wrong?”’ The therapist replied that he didn’t know if Mr. Q was wrong about his wife, 
that he didn’t know her or anything about her actions. He pointed out that intelligent people had 
often been wrong about things that they felt sure about and went into an explanation of defense 
mechanisms in general and of projection particularly. The patient listened to this without saying 
anything, but was obviously skeptical. After all, he had seen what he had seen and he couldn’t 
have been mistaken. The therapist stated that projection was an unconscious mechanism and 
served to relieve the individual of feelings of guilt. Of course, the patient was unable to accept 
this and pointed out (not without considerable heat) that he had nothing to feel guilty about. 
However, he did accept the therapist’s invitation to talk things over further and a series of daily 
sessions was scheduled. 


The Shell Cracks. During the first few weeks nothing much happened. Mr. Q was intelligent 
and could understand explanations in general, but could not accept their personal application. 
He had difficulty in free association, reported no dreams and was becoming discouraged. De- 
tailed reporting of these sessions would add little to the reader’s understanding of this case. 

One day the patient asked whether hypnosis might not be useful in discovering some of the 
unconscious material that he wouldotherwise be unable to report. The therapist was not enthusi- 
astic. He had attempted unsuccessSully to use hypnosis previously with paranoids, usually failing 
in hypnotizing them, and thus lowering his status in the patient’s eyes. Further, many paranoids 
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are said to have a homosexual conflict at the core of their delusions and hypnosis induced by a 
male therapist might be panic-producing and eliminate all possibility of working with them. 

On the other hand, no significant progress had been made since the first session. The patient 
was becoming discouraged and seemed to feel that hypnosis might be of some help in his attempt 
to find some answers. So, the therapist allowed himself to be persuaded. He found that, in spite 
of Mr. Q’s earnest desire to try hypnosis, he was a poor subject and only a very light trance was 
induced after several sessions. Through the cooperation of the ward physician, sodium amytal 
was used to condition deepened trance and during subsequent sessions trances of adequate depth 
were obtained. 


The Story of Mr. Q. The early life of this patient was never investigated thoroughly. In the 
pre-hypnosis sessions the patient told of his work prior to service, his service experiences and 
what happened to him after he returned home. The account being re ported here contains material 
obtained both during the conscious and trance states. Where important, the state of the patient 
when the material was obtained will be stated. 

Before the war Mr. Q had achieved considerable status in his firm, a large manufacturing 
concern, with several plants in various parts of the country, and became one of its officers. During 
the early months of our participation in World War II, Mr. Q had been deferred because his con- 
tribution to the war effort through the technical training he gave service people was deemed to be 
essential. However, Mr. Q traveled to Africa, worked with combat experienced individuals and 
grew increasingly dissatisfied with his own contribution and his civilian status. 

Finally this became unbearable to him and he enlisted, hoping to become a combat pilot. 
Because of a visual defect he was rejected for pilot training. Eventually he became a maintenance 
man and tail-gunner. He was stationed in England and flew many missions over Germany with- 
out mishap. On the way back to the base the plane was struck with flak and the crew was forced 
to abandon it over the Channel. They were all picked up and assigned another plane. This 
bomber had been through heavy service and had been laid up for repairs for some time before it 
was assigned to Mr. Q’s crew. Mr. Q went over the ship meticulously until he felt sure it was ready 
for its next mission. The ship crashed on its first flight, from a very low altitude. The pilot, co- 
pilot and Mr. Q were the only members of the crew that survived. The patient was asked whether 
he felt he was in any way to blame for the crash or whether others felt him guilty to any extent. 
He denied both rationally and without apparent disturbance. He explained that he was thorough- 
ly familiar with that type of plane and that he had given it particular attention because of its 
obvious signs of damage and repair. He said also that his pilot had told him that the ship must 
have been sabotaged and gave no indication that he felt that Mr. Q had been negligent. 

This material was volunteered by the patient during the early conscious interviews. Mr. Q 
was again questioned about his feelings regarding the crash while he was in a light trance state. 
He confirmed the above account. When a deeper hypnotic state was achieved, the patient was 
regressed back to the time when he was in the hospital immediately after the cr: ash. He was — 
that when he regained consciousness he would hear his pilot and co-pilot talking and would repea 
their remarks to the therapist. After a time the patient became quite agitated and reported the 
they felt he had goofed and was responsible for the deaths of the other members of the crew. As a 
result, he felt first angry and then began to have feelings of self-doubt and remorse. He began to 
wonder if he had forgotten to make certain crucial checks and to question whether he wasn’t 
actually to blame for the crash. After these feelings had been explored in detail, amnesia was sug- 
gested and the trance was terminated. 

Let us now return to Mr. Q’s story. He had not been seriously injured and was soon back on 
duty. He seemed to take his next few missions as he always had. However, he gradually became 
jittery, was troubled with insomnia, was short-tempered and began to drink increasingly. He had 
only a few missions to make before he was eligible for rotation to the states and he was able to 
make them, although not without considerable difficulty. Mr. Q was sent to a stateside assign- 
ment for a short time and then given an honorable discharge on points (duration of service). 

Hypnosis enabled the therapist to obtain additional material concerning this period that was 
not available during the conscious state. During his brief assignment in the states, Mr. Q became 
acquainted with a beautiful young girl and spent a considerable amount of time in her company. 
This was out of character for him. He had never had a date during his entire service career until 
this time. He celebrated his release from the service with this girl and both of them were in- 
ebriated. This binge evidently lasted about ten days and apparently was accompanied by sexual 
intimacy. At any rate, during the trance Mr. Q reported that he had no memories until awakening 
one morning in bed with the girl, both nude. The apartment was in a disheveled state and Mr. 
Q found a letter from which he learned that the girl was also married and that her husband was in 
an Army hospital recovering from injuries received in combat. The patient felt very guilty about 
his indiscretions and his remorse was not lessened when he discovered that his entire severance 
pay was gone. He did not know how much of it he had given to the girl and didn’t wait to find 
out. He dressed as hurriedly as he could and left. It was necessary for him to wire collect to his 
wife for funds with which to return home. 

Apparently, this whole episode was repressed, since no information concerning it was ob- 
tained except during deep trance states. It was felt that the patient would be unable to handle 
this material immediately and amnesia for it was suggested before he was awakened. 
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Therapeutic Management. The patient was seen every day for a period during which the first 
half of the session was spent in trance and the rest in the conscious state. The material lying be- 
hind his delusions was worked through carefully with him while in the trance. Understanding of 
the mechanisms involved was easily obtained. It was repeatedly sugge sted to him that he would 
gradually remember the repressed material as he was able to handle it. This was accomplished 
over a period of about one month. 


The amount of trance used was gradually reduced and none at all was used during the last 
week and a half. All sessions were carried on vis-a-vis and everything possible was done through 
discussion and the hypnotic procedures used to allow the patient to feel that he was in control of 
the situation. The patient’s remission was apparently complete. After release from the hospital 
he returned to his pre-service position and the reply to a follow-up letter three years later from his 
wife indicated that Mr. Q was functioning at his previous high level of effectiveness and that the 
marital relationship was excellent. 

DIscUSSION 

This was a very intelligent and highly moral man who set very high goals and 
was able to accomplish much due to his ability and capacity for sustained work. In 
stature only was he a small man. He had seldom experienced failure, never in an 
area that was important to him. Suddenly he was confronted with a situation that 
caused him to question whether he was responsible for the death of several buddies. 

Although this was a severe trauma and caused some temporary neurotic symp- 
toms, he was still able to function and, apparently, got them well enough in hand 
without formal] therapeutic assistance so that he was given a non-medical discharge, 
based entirely on length of service. The episode with the girl was too much for him. 
Here he was proven to be lacking in the common decencies. The vicious nature of 
his attacks upon his wife upon his return home is a direct indication of the intensity 
of his rage over his own despicable actions. 

The writer often has felt that therapists frequently impose their theoretical 
framework upon patients through their interpretations. Acceptance by the patient 
indicates more about the transference relationship than it does about the ‘“‘truth”’ or 
even reasonableness of the interpretations. It is possible that the stage had been set 
for the patient’s account of the episode occurring in the hospital immediately after 
the crash, by the therapist’s interest in this period in the early sessions before hyp- 
nosis was used. Howev er, the material presented was not suggested by the therapist, 
whose only contribution was putting the patient in the spot at the time when the in- 
cidents could have occurred and to ask him what was going on. The account of the 
episode with the girl after his release from the service came as a complete surprise to 
the therapist and could not have been influenced by him. 

This patient’s pre-psychotic personality structure was not altered by the thera- 
peutic experience and the dynamics and early personality development were not 
thoroughly investigated. The therapist felt that such an attempt would have in- 
vited another overt psychotic break. It seemed reasonable to assume that this 
patient had made a satisfactory adjustment prior to his service experience and that 
therapy would be successful if his defenses could be reinstated so that he could 
function at a level approximating his pre-psychotic performance. This apparently 
was accomplished. 


SUMMARY 


The therapeutic history of a young male paranoid schizophrenic has been re- 
counted. This patient had made a satisfactory pre-service adjustment and ap- 
parently Qesdiaind a paranoid schizophrenic break as a result of intense guilt asso- 
ciated with an airplane crash which killed many of his buddies and for which he felt 
responsible. The psychosis was finally precipitated by intense guilt feelings over a 
promiscuous episode. He apparently made no immediate therapeutic gains with 
electro-convulsive and insulin coma therapy. Readiness for individual psychotherapy 
was indicated through his reaction to a story told by another patient during a group 
therapy session. Intensive attempts at uncovering psychotherapy were unsuccessful. 
At the patient’s request, hypnosis was used and apparently was successful not only in 
uncovering the dynamics involved in the development of the psychotic symptoma- 
tology but also in the therapeutic manipulation of the patient. 
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THE UNITARY CONCEPT OF MENTAL ILLNESS vs. UNITY OF 
PERSONALITY INTEGRATION 


Karl Menninger“? recently has expressed his bold opinion that mental illness 
is ‘essentially the same in quality, and differing, rather, quantitatively.”’ In reaching 
this viewpoint, Menninger cites the historic evolution of the unitary concept of 
mental illness culminating in the dictum of Heinrich Neumann that ‘“There is only 
one kind of mental illness.”’ This organismic view regards all mental illness as re- 
flecting various degrees of disruption of homeostasis, with various stages of break- 
down in self-regulation being reflected along a range of severity of symptoms from 
nervousness to chronic psychosis. Unfortunately, Menninger does not cite objective 
research evidence supporting this view, but is content to quote eminent authorities 
who have supported the concept. 

It is easy to understand the reasoning leading to this view. We have reached a 
point in psychiatric understanding where we recognize (a) that no sharp delineation 
occurs between the neuroses and the psychoses, (b) that mixed symptom pictures 
are the rule rather than the exception, and (c) that the classical Kraepelinean classi- 
fication system breaks down to the point where, at least with younger patients, 
practically all psychotic reactions are labelled as schizophrenic with all lesser re- 
actions being labelled as psychoneuroses (usually anxiety reactions). Trousseau’s 
principle of specificity had irrefutable application to physical disorders but seems to 
break down when applied to psychiatric conditions. The denouement occurred 
within the last 10 years as an increasing weight of research evidence cast great ques- 
tions on the validity of classic psychiatric diagnostic criteria and classification sys- 
tems. Increasing clinical experience brought cleancut realization that the so-called 
diagnostic entities were not discrete and specific typologies but instead were the re- 
sultants of processes occurring in various degrees and combinations. In this situa- 
tion, the concept that mental illness is unitary is a grossly inadequate and nihilistic 
solution to the taxonomic difficulty although it may have appeal in other respects. 

In a broad sense, all disturbances which interfere with the normal performance 
of an individual’s life functions have a common link. There is some unity among all 
illnesses both physical and mental. However, this need not imply that the disabling 
or impairing processes are homogeneous. The evidence seems to be rather that they 
are manifold, that the problem involves a multivariate psychophysiological organ- 
ism-environment interaction system. The present nosological entities represent 
interaction patterns of relatively high frequency, but obviously do not account for 
all or even a great number of patterns occurring. An improved nosology is urgently 
needed. In our opinion, however, the suggestion that the present nosological cate- 
gories should be combined in a broader one, because they overlap to some degree and 
fail to classify a large number of cases, is not a solution. It appears more an admission 
of failure. 

The taxonomic problem in mental illness is similar to that of personality, and 
the unitary concept of menial disorder is analogous to the unitary concept of person- 
ality integration. Both concepts reflect a resultant of complex interaction processes, 
but neither is explanatory of the component processes. To the diagnostician, the 
therapist, the counselor, the nature of these component processes is the essence and 
the taxonomic system must be explanatory of these. To be certain we are dealing 
with an whole organism which seeks to maintain unity or self-consistency, to main- 
tain homeostasis and self-regulation, both on physical and mental levels. The fact 
that homo sapiens is constituted as he is, and must struggle to maintain integration 
in a world of rather constant composition, means that human symptomatology and 
adaptive mechanisms occur phenomenonologically within certain deterministic 
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limitations, 7.e., within a limited number of etiologic patterns. The fact is that the 
so-called ‘“‘unitary”’ characteristic can be determined either by the efforts of the 
organism to maintain “unity” or by the ‘‘unitary”’ nature of disorder if, indeed, such 
actually exists. We are inclined to believe that the “unitary” quality is contributed 
more by the struggle of the organism to organize integration, and that the “unitary”’ 
quality of mental disorder is due, if anything, to the factor of simply being sick 
mentally. Unity is therefore considered as a phenomenonological resultant of more 
or less successful personality integration on whatever level the organism can organize 
itself. 

A similar logical distinction must be made in categorizing both physical and 
mental disorder. A preliminary diagnosis is always made in arriving at the de- 
cision that a person is ‘“‘sick’’ without regard to what the specific disease entity may 
be. Any form of illness involves some breakdown of homeostasis and normal organ- 
ismic regulation. All symptoms may be arranged on a continuum from the least 
serious malaise to the most serious terminal coma. This does not mean that we are 
dealing with an “unitary” disease having protean manifestations. A practical appli- 
cation of this point occurs in relation to the interpretation of factor analysis data 
such as Eysenck’s®?) where higher order factors of psychoticism and neuroticism 
have been interpreted as indicating what might be identified as a unitary mental 
disorder. Actually, such findings cannot be accepted definitively as contributing to 
the present problem. At this time, considering the nature of factor analytic design, 
such higher-order traits probably can be interpreted more parsimoniously as referring 
to the generalized continuum of degrees of sickness rather than to specific processes. 
The recent work of Cattell, in integrating a tremendous amount of factor-analytic 
research, gives promise of progress in the development of an empirical basis for a 
multivariate model for the specific processes. 

In the light of the whole history of clinical practice and our understanding of 
the multitude of factors etiologic in pathological conditions, it is extremely improb- 
able that all (or even most) clinical manifestations can be explained by one determin- 
istic principle. All illness, both mental and physical, appears to involve at least two 
factors: (a) a pathological process (pathosis), and (b) the struggle of the organism to 
neutralize its effects. We contend that Menninger’s formulation emphasizes the 
latter but may actually impede understanding of the former. 

Finally, it would be extremely unfortunate if any widespread acceptance of the 
unitary concept of mental illness were to inhibit continuing efforts to differentiate 
etiologic factors on the grounds that such efforts are useless because we are dealing 
with only one process anyway. Let us admit frankly that existing theories of psycho- 
pathology and psychiatric classification are of dubious validity, and that we have 
come to a clinical impasse until we can develop more valid formulations. We contend 
that it is only by the systematic appraisal of all levels of the organization of person- 
ality integration by some system: & *) that we can ever achieve an objective opera- 
tional understanding of all the classes of etiologic agents and organism-environment 
interactions which may determine health or illness. This means that since every 
organism is more or less unique, that we must devise a method of diagnostics capable 
of preserving individual uniqueness within the framework of psychopathological 
conditions known to occur. 

S. B. S. 
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Hab.ey, JoHN M. Clinical and Counseling Psychology. New York: Alfred A. Knopf‘ 

1958, pp. 682. $8.95 trade; $6.75 text. 

Dr. Hadley is professor of psychology at Purdue University, specializing in its 
clinical applications. He regards the functions or aims of clinical and counseling 
psychology as those of diagnostic evaluation, behavior modification (therapy) and 
research. To accomplish this, he has amassed and interpreted an almost encyclo- 
pediec cross-section of pertinent research in clinical psychology. The result is a com- 
prehensive textbook which truly reflects a new era in the development of the field. 
Prior to this date, clinical textbooks largely have been rehashes of the fields of ab- 
normal psychology, psychoanalysis and psychiatry, with clinical methods of 
diagnosis and therapy being treated descriptively, and without much attempt or 
success in relating practical applications to systematic theory. This book succeeds 
remarkably well in bridging the gap between basic science psychology and practical 
clinical applications, citing the major developments which contribute to the most 
up-to-date practices. 

It is somewhat paradoxical that in the organization of the book, the author 
discusses the theory and practice of psychological counseling in roughly the first half 
of the book while leaving evaluation and assessment to the latter half. This arrange- 
ment may reflect either the author’s own interests and strengths, or the fact that 
clinical psychology itself in recent years perhaps has moved ahead faster in the field 
of therapy than in diagnosis. We regard it as a serious defect to discuss therapy 
ahead of diagnosis, not only because students may conclude that therapy can be 
accomplished without much diagnostic preparation, but mainly because we feel that 
all rational counseling must depend on valid diagnosis which should be given prior 
consideration. In any case, the first half of the book on counseling is much stronger 
and better integrated with current basic science theory than the last half on evalua- 
tion and assessment which tends to fall back on the old practice of simply describing 
the most commonly used tests. 

The major defects of the parts on evaluation and assessment are that the 
treatment largely fails to interpret the individual tests in terms of the accumulating 
mass of factor analysis research and findings. The theory and practice of psycho- 
diagnosis are being revolutionized currently by the flood of new information com- 
ing from factorial research. The basic coordinates of psychological space have now 
been mapped out by factorial research as large numbers of factors of intelligence and 
personality have been identified and related to each other. This means that in order 
to be valid, the theory and practice of clinical psychology must relate itself to known 
factors to increasing degree, and that it is no longer acceptable to merely give des- 
criptive citations to existing tests and methods. 

It will take many decades before clinical psychology is able to separate the 
wheat from the chaff in discriminating what is actually valid in the hodge-podge of 
currently available methods. This book represents one of the first major attempts 
to rework the whole field of clinical psychology towards newer directions. Un- 
fortunately, the field is evolving so rapidly that even the best of books are in danger 
of being out-dated almost before they can be gotten into print. It is to be hoped that 
future authors will carry on in pioneering spirit where this book leaves off. 


Hurt, Max L. and Gipsy, Ropert G. The Mentally Retarded Child. Boston: Allyn 
and Bacon, 1958, pp. 334. $4.50 
This is an elementary textbook discussing the development, education and guid- 
ance of the mentally retarded child. It is suitable as an introductory text for teach- 
ing or as a reference book for laymen. 
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by J. E. WALLACE WALLIN, Ph.D. 


This important book summarizes the author’s life-long experience in dealing with 
practical social problems relating to the control of mental defectives. Dr. Wallin bases his 
authoritative opinions upon an intensive review of the literature in this field and his con- 
clusions reflect the latest scientific information. This book differs from other works on 
mental deficiency in that it deals with many practical issues for which society urgently 
demands some answers. It should be read by all psychiatrists, psychologists, social work- 
ers, educators, parents and other workers in the field. 
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Selection and arrangement of blots made on the basis of evidence of character, 
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Person Perception 


and Interpersonal Behavior 
Edited by RENATO TAGIURI and LUIGI PETRULLO 


' The ways in which one person is aware of what another person feels, 
wants, or does have been among the least-explored areas of psychology, 
yet they are at the heart of many problems in enterpersonal behavior. 
This is the first book in English to deal exclusively with these matters. 
The scattered periodical literature that has dealt with person perception 
has been particularly sparse in theoretical contributions. But here, with 
contributions by a distinguished body of psychologists, is a work that has 
systematically marked out areas of theory as well as of research. With 
the exception of two classic articles here reprinted, the 23 papers were 
prepared for a symposium on person perception held at Harvard Uni- 
versity in 1957. $7.50 


Psychotherapy 


by Reciprocal Inhibition 


JOSEPH WOLPE, M.D. 


The quick and lasting results which Dr. Wolpe has obtained with the 
pioneer method of psychotherapy described in this book have already 
begun to attract the attention of psychiatrists and psychologists every- 
where. Reciprocal inhibition is the removal of the patient’s anxiety by 
the application of proved principles of learning and motivation and is 
the first system of psychotherapy that is a direct product of modern 
learning theory. Part I presents the theoretical, experimental, and 
clinical material that forms the background to this approach. Part II is 
devoted to a consideration of the techniques of therapy based on the 
reciprocal inhibition principle. The book concludes with a statistica! 
analysis of 210 patients treated who have been either apparently ¢ :re¢ or 
much improved after a median of just over 20 interviews. #5.00 
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NEW BOOKS 
in this field 


THE MORAL THEORY OF BEHAVIOR (Third Printing). By Frank R. Barta, 
Omaha. “A New Answer to the Enigma of Mental Illness’’ is the sub-title of this chal- 
lenging little book which offers a new theory of behavior based on Aristotelian and 
Thomistic principles of philosophy and psychology. In reviewing the book, the Digest 
of Neurology and Psychiatry said “it has a religious emphasis for which there is ample 
room in the field of psychiatry.”’ ‘. . . not meant for casual reading but rather to be 
studied, read and re-read for it offers a great deal of food for thought.’’—The Journal of 
the Assn. for Physical and Mental Rehabilitation. Pub. °57, 45 pp., 3 il., Lexide, $2.00 


ELECTRONIC INSTRUMENTATION FOR THE BEHAVIORAL SCIENCES. By 
Clinton C. Brown and Rayford T. Saucer, Veterans Adm. Hosp., Perry Point, Md. 
By presenting electronic theory, circuit design and a wide variety of instrumentation 
techniques, this book provides the features of both a text and a handbook. For research 
investigators in experimental psychiatry, psychophysiology, physiology and other fields 
where electronic instrumentation is employed for stimulation or measurement of response 
processes. A wide variety of circuits are given, employing current technological advances, 
which are simple, practical and may be constructed with average workshop facilities. 
Pub. ’58, 176 pp., 73 il., Cloth, $5.50 


HYPNOGRAPHY. By Ainslie Meares, Melbourne. A study in the therapeutic use of 
hypnotic painting, a unique new aspect in the treatment of mental illness. The hyp- 
notized patient projects his conflicts in graphic form and the material thus obtained is 
used to help the patient to insight. The book contains reproductions of a large number of 
these paintings, furnishing dramatic evidence of the ventilation of unsuspected con- 
flicts. This work is not the result of an experimental research project but springs from 
careful clinical observation of patients actually undergoing treatment. Pub. °57, 28 pp., 
385 il., Cloth, $7.75 


MARRIAGE AND PERSONALITY: A Psychiatric Study of Inter-personal Re- 
actions for Students and Laymen. By Ainslie Mears, Melbourne. A dynamic 
presentation of the factors of personality and how they operate in marriage. Effectively 
demonstrates how personality can be developed in the marriage relationship. This is 
neither a sex book nor a religious guide to marriage. Based on case histories, it explains 
how the inter-personal reactions and counterreactions of marriage can work 
for the maturation of the personality. It should be on the bookshelves of psychia- 
trists, general practitioners, clergy, marriage guidance workers and social workers. 
Pub. ’57, 168 pp., Cloth, $3.95 
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THIS WE BELIEVE ... it is the sincere belief of The Devereux 
Foundation that every child possesses certain latent capabilities, 
and that means can be found for developing them to the fullest 
capacity. 


To that end, every phase of the Devereux student’s life is planned 
in the light of what is best for the individual child — a “multi- 
disciplined’ approach, calculated to release those capabilities. 


Children find at Devereux a therapeutic environment in which 
there is acceptance and love and satisfaction of their basic 
need of recognition as individuals. 


Professional inquiries should be addressed to 

John M. Barclay, Director of Development, 

Devereux Schools, Devon, Pennsylvania; 

western residents address Keith A. Seaton, Registrar, 
Devereux Schools in California, Santa Barbara, California. 
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